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HUMAN IMMUNODEFICIENCY VIRUSES CAUSING AIDS 
IN A NONHUMAN PRIMATE 
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60/057,606, filed September 4, 1997. 
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BACKGROUND OF THE INVENTION 

This invention concerns unique isolates of human immunodeficiency virus type 1 
(HIV-1 JC ) which are highly infectious in vivo and produce acquired immune deficiency 
syndrome (AIDS) in a nonhuman primate. 

The human immunodeficiency viruses types 1 and 2 (HTV-1, HIV-2) are retroviruses 
which have been implicated as the causative agents of acquired immune deficiency syndrome 
(AIDS) (Barre-Sinoussi et al. [1983] Science 220:868-871). While skepticism about the 
exact cause of AIDS has arisen (Duesberg [1991] Proc. Natl. Acad. Sci. 88:1575-1579), a 
large amount of data has accumulated which supports HIV as the biologic agent of this 
disease The most convincing evidence includes: mother to child transmission; transmission 
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via blood transfusion; and transmission via contaminated blood products (Curran et al. [1 984] 
N Engl J Med. 310:69-75). The development of disease in an animal inoculated with HIV- 
1 would provide confirmatory evidence for the etiology of AIDS. While AIDS-Iike disease 
has been recently demonstrated in HIV-2-infected baboons (Barnett et al. [1994] Science 
266'642-646), to date, no species, other than humans, has developed AIDS following 
infection with fflV-1 . Thus, the most appropriate animal model for HIV- 1 infection currently 
does not exist However, an effective alternative animal mode, has been developed usmg 
simian immunodeficiency virus (SIV) infection of macaques. This model has been 
extremely useful for investigating the pathogenesis of immunodeficiency virus infection and 
for the testing of vaccines and therapies (Gardner et al. [1994] in The Retroviruses, vol. 3 
[Levy, ed.] Plenum Press, New York). 

Experimental infection of nonhuman primates with human immunodeficiency virus 
type 1 (HIV-1) has been described for gibbons (Lusso et al. [1988] J. Immunol. 141:2467- 
2473) pig-tailed macaques (Barre-Sinoussi et al., supra), and chimpanzees (Alter et al. 
[1984] Science 226:549-552; Fritz et al. [1986] J. Virol. 58: 1 16-124). However, the 
development of disease (AIDS) has not been documented in these animals or in any ammal 
species infected with HIV-1 . For many years, chimpanzees have been a major focus m the 
development of animal models for HTV-1 infection and therapy. The ability to consistently 
Meet chimpanzees with several HIV-1 subtypes and to reisolate the virus over extended 
periods (Fultz et al., supra; Johnson et al. [1993] AIDS Res. Hum. Retroviruses 9:375-378) 
has made chimpanzees useful for testing vaccine candidates (Herman et al. [1990] Nature 
345:622-625; Fultz et al. [1992] Science 256:1687-1690; Girard et al. [1995] J. Virol. 
69:6239-6248). However, the lack of disease in HIV-1 -infected chimpanzees has raised 
concern over the relevance of these vaccine studies. 

The present invention provides HIV-1 isolates that are infectious to nonhuman 
primates such as chimpanzees and induce AIDS in inoculated nonhuman primates including, 
but not limited to, chimpanzees. Further, this invention provides for the first time a proper 
animal model for HIV-1 infection and for the development of AIDS. 
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A first aspect of the present invention are the isolated human immunodeficiency v,ms 
type 1 (HIV-1) isolate having the identifying characteristics of HIV-1 isolate JC (HIV-1 JC ) 
and assigned AIDS Reagent Program Catalog Number 3523 and the isolate having the 
identifying characteristics of HIV-1 isolate NC (HIV-1 NC ). These viruses are useful for the 
preparation of recombinant, attenuated and subunit vaccines, as well as for the preparation of 



challenge 
samples. 



stocks. Each is also useful in screening for the presence of HIV in biological 



A second aspect of the invention is a biological sample, e.g., a biological fluid or 
tissue, containing an HIV-1 having the identifying characteristics of HIV-l Jc or HIV-1 NC . In a 
particular embodiment of the invention, primate blood specimens containing the HIV-1 JC or 
the HIV-1 NC of the present invention were used to cause HIV infection and to induce AIDS m 
a nonhuman primate, as specifically exemplified by the chimpanzee. 

A third aspect of the invention are a biologically pure culture of host cells containing 
an HIV-1 having the identifying characteristics of HIV-l Jc and a biologically pure culture of 
host cells containing an HIV-1 having the identifying characteristics of HIV-1 NC . 

A further aspect of the invention are isolated DNA molecules which produce 
infectious HIV-1 having the identifying characteristics of HIV-1 , c or which encode an 
antigenic fragment thereof or having the identifying characteristics of HIV-l* or encode an 
antigenic fragment thereof. The substantially full length sequence for infectious molecular 
clone of HTV-1 JC is given in SEQ ID NO:l 1, and the corresponding sequence for the 
infectious molecular clone of HIV-1 NC is given in SEQ ID NO:12. 

An additional aspect of the invention provides isolated DNA encoding an HIV-1 
envelope protein having the amino acid sequence of SEQ ID NO:2. In a particular 
embodiment, the DNA sequence encoding an HIV-1 envelope protein contains the nucleoude 
sequence of SEQ IDNO:l. 
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A further aspect of the invention are a composition comprising an antigenic 
preparation derived from the HIV-1 JC of the invention and a composition comprising an 
antigenic preparation derived from the HIV-1 NC of the present invention. 

An additional aspect of the invention are pharmaceutical compositions comprising an 
immunogenic amount of an antigenic preparation derived from the HIV-1 JC of the invention 
in a pharmaceutically acceptable carrier or HIV-1 NC in a pharmaceutical^ acceptable carrier. 

A further aspect of the present invention is a kit for detecting the presence of HIV- 1 
antibodies, comprising an antigenic preparation derived from the foregoing HIV-1 JC or from 
HTV-1 NC as described herein. 

Another aspect of the invention is a method of inducing in a subject antibodies to the 
HIV-ljc or to HIV-1 NC of the invention, comprising the step of administering to the subject an 
immunogenic amount of an antigenic preparation derived from the HTV-l Jc or from the HIV- 
1 The induced antibodies can be harvested and find use, for example, as hybridization 
probes in methods for HIV scanning. 

A further aspect of the invention is a method of immunizing a subject against 
infection by HIV, comprising the step of administering to the subject an immunogenic 
amount of an antigenic preparation derived from the HIV-1 JC or derived from HTV-1 NC of the 
present invention. 

It is another aspect of the invention to provide a method for inducing acquired 
immune deficiency syndrome (AIDS) in a nonhuman primate, comprising the step of 
administering to said primate an effective amount of an antigenic preparation derived from 
the HTV-1 JC and/or HrV-l NC of the invention such that said primate develops AIDS. In a 
specific embodiment of the invention, HIV-1 JC induced AIDS in chimpanzees, e.g., in a 
chimpanzee referred to as C499. In a different embodiment of the invention, HIV-1 JC 
contained in a biological sample from a chimpanzee having AIDS (C499) induces AIDS in a 
second chimpanzee and in a different primate species. 
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It is another aspect of the invention to provide a method for inducing acquired 
immune deficiency syndrome (AIDS) in a nonhuman primate, comprising the step of 
administering to said primate an effective amount of an antigenic preparation derived from 
the HIV-1 NC of the invention such that said primate develops AIDS. In a specific 
embodiment of the invention, HIV-1 NC induced AIDS in chimpanzees, e.g., in a chimpanzee 
referred to as C534. In a different embodiment of the invention, HIV-1 NC contained in a 
biological sample from a chimpanzee having AIDS (C455) induces AIDS in a second 
chimpanzee and in a different primate species. 

A further aspect of the invention provides a biological fluid or tissue sample, obtained 
from a chimpanzee having HIV-l, c -induced AIDS, containing an antigenic HTV fragment for 
inducing AIDS in a nonhuman primate. 

A further aspect of the invention provides a biological fluid or tissue sample, obtained 
from a chimpanzee having MV-l NC -induced AIDS, containing an antigenic HTV fragment for 
inducing AIDS in a nonhuman primate. 

It is yet another aspect of the invention to provide a method for inducing AIDS in a 
nonhuman primate, comprising the step of administering to said primate an effective amount 
of a biological fluid or tissue sample, obtained from a primate having HIV-l JC -induced or 
HTV-l NC -induced AIDS, containing an antigenic HTV fragment for inducing AIDS in a 
nonhuman primate. 

The present invention further provides various vaccine formulations containing active 
immunogenic agents derived from the foregoing HrV-l JC> DNA encoding the HIV-1 JC , and 
DNA encoding antigenic fragments of the HTV-l JC ,from the foregoing HIV-1 NC , DNA 
encoding the fflV-l NC , and DNA encoding antigenic fragments of the HIV-l^. An antigenic 
fragment contains one or more epitopes which bind antibodies directed to the HIV-1 JC and/or 
HTV-1 NC of the invention. 



5 



WO 98/59074 



BRIEF DESCRIPTION OF THE FIGURES 



PCT/US98/12990 



Fig. 1 shows the CD4+ -cell decline and plasma virus loads in chimpanzee C455 
following transfusion with blood from C499. Immediately before and at various times after 
transfusion, blood was collected from C455 for in vitro analyses of CD4+ -cell levels and 
plasma virus loads. Absolute peripheral CD4+ cells in C455 showed a dramatic decrease by 
2 weeks post-transfusion. This rapid decline continued and, by 14 weeks after transfusion, 
the number of CD4+ cells decreased to 10/ul. These low cell numbers have been maintained 
to date (42 weeks post-transfusion). Plasma HIV-1 RNA loads in C455 showed high levels 
of virus present by two weeks after transfusion. Results obtained at the 4-week point suggest 
very high levels of virus replication in C455. However, HIV-1 levels appeared to be 
somewhat controlled by 5 to 9 weeks post-transfusion. The cutoff level of 10 4 equivalents/ml 
(— ) is the lower limit of the assay. 

Fig. 2 illustrates CD4+ -cell decline in chimpanzee C534 following transfusion with 
blood from C455. Immediately before and at various times after transfusion, blood was 
collected from C534 for in vitro analyses of CD4+ -cell levels. Absolute peripheral CD4+ 
cells in C534 showed a dramatic decrease by 20 days post-transfusion. 

Figure 3 illustrates the strategy used in PCR amplification of subgenomic fragments 
from HIV-1 and the location and orientation of primers on the viral genome are shown. All 
the primers were designed from the HTV-l^, nucleotide sequence and their coordinates are 
described below. PCR amplified fragments were cloned in TA vectors and the corresponding 
name designations for the recombinant plasmids are shown in bracket. The 5'-LTR- 
containing Apa I fragment amplified from HIV-1 JC PBMC genomic DNA was subcloned in 
pJC to generate plasmid pHrV-lx^ while the Apal-Ncol fragment containing the 5'-LTR 
region amplified from HTV-1 NC PBMC genomic DNA was subcloned in pHTV-l JC16 to 
generate pfflV-l NCJC chimeric plasmid. Plasmid pHIV-l NC7 was constructed by subcloning 
the ertv-containing Ncol-Xhol fragment amplified from HIV-1 NC genomic DNA to P HIV-1 NCJC 
chimeric plasmid. All recombinant plasmids (pHIV-l Jcl6 , pHrV-l NCJC , and pHTV-l NC7 ) lacked 
55 bp at the 5' end of the genome (U3 region) and all of U5 region in the 3' LTR region. 
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Figures 4A-4F show replication of cloned and uncloned HIV-1 isolates in con- A 
stimulated and unstimulated chimpanzee PBMC (cPBMC). Stimulated cPBMC infected with 
(Fig. 4 A) HIV-a LAV- lb and SF2 parental strains and the highly cytopathic DH12 isolate, 
(Fig. 4B) JC (uncloned) and JC16 (cloned) isolates of HIV-1, and (Fig. 4C) NC (uncloned) 
and NC7 (cloned) isolates of HIV. Unstimulated cPBMC infected with: (Fig. 4D) LAV- lb, 
SF2, and DH12; (Fig. 4E) JC, JC16; and (Fig. 4F) NC, NC7. Chimpanzee PBMC (1.1 x 10 7 ) 
in T-25 cm 2 were infected with 20 ng of either HIV-1 NC (uncloned), HIV-1 SF2 , HIV-l LAV . lb , or 
HIV-l DH i2 virus and incubated for a total of 17 days at 37°C. Supernatant aliquots were made 
on 3, 7, 10, 14, and 17 days post infection. Reverse transcriptase (RT) assays were performed 
as outlined hereinbelow. 

Figure 5 shows replication of HIV-1 isolates in chimpanzee monocyte-derived 
macrophages (MDM). Purified PBMC (6 x 10 6 /well) were used to obtain MDM. Ten ng of 
each of virus HIV-1 JC16 (molecular clone), HIV-1 JC (uncloned), HIV-1 NC7 (molecular clone), 
HIV-Inc (uncloned), HIV-1 SF2 , fflV-l LAV . lb , and HIV-1 DH12 ) was used for infection and on 
days 7 and 14 post infection supernatants were harvested. The mount of virus in the 
supernatants was determined using the p24 HIV-1 antigen capture ELISA (Coulter). 
Supernatants from control uninfected cultures are represented. 

DETAILED DESCRIPTION OF THE INVENTION 

The following definitions are given in order to provide clarity as to the intent or scope 
of their usage in the specification and claims. 

The term HIV-1 isolate JC or HIV-1 * or HIV JC or HIV-1 having the identifying 
characteristics of HIV-1 JC as used herein refers to the particular HIV-1 isolated from a 
chimpanzee (C499) that developed AIDS 10 years after infection with HIV-1 (Novembre et 
al. [1997] J. Virol. 71:4086-4091). The DNA sequence of the cloned HIV-1 JC is given in 
SEQ ID NO:ll. 
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The term HIV- 1 isolate NC or HI V- 1 NC or HI V NC or HIV- 1 having the identifying 
characteristics of HIV-1 NC as used herein refers to the particular HIV-1 isolated from a 
chimpanzee (C455) that developed AIDS after infection with HIV-1 JC (Novembre et al. 
[1997] J. Virol. 71:4086-4091). The DNA sequence of the cloned HIV-1 NC is given in SEQ 
ID NO:12. 

The term antigenic preparation of HIV-1 JC or antigenic fragment of HIV-1 JC as used 
herein refers to the whole viral particle of the HIV-1 JC or to a fragment thereof, wherein such 
fragment encodes at least one epitope or antigenic determinant. The term antigenic 
preparation of HIV-1 NC or antigenic fragment of HIV-1 NC as used herein refers to the whole 
viral particle of the HIV-1 JC or to a fragment thereof, wherein such fragment encodes at least 
one epitope or antigenic determinant. 

The term animal or subject as used herein refers to a mammal and, more frequently, to 
a primate. 

The term effective amount as used herein refers to the quantity of active ingredient 
necessary to effect in an animal a change in a specific biochemical or immunological 
parameter. For example, in a particular embodiment of this disclosure, an effective amount 
refers to the amount of HIV-1 administered to a subject such that viral infection and AIDS 
developed. 

The term antigenic amount as used herein refers to the quantity of active antigen 
necessary to effect an interaction with corresponding antibodies. 

The term immunogenic amount as used herein refers to the quantity of active antigen 
necessary to stimulate the immune system in response to a specific antigen. 

The term labeled is used herein to refer to the conjugating or covalent bonding of any 
suitable detectable group, including enzymes (e.g., horseradish peroxidase, p-glucuronidase, 
alkaline phosphatase, and p-D-galactosidase), fluorescent labels (e.g., fluorescein, luciferase), 
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and radiolabels (e.g., 14 C, m I, 3 H, 32 P, and 35 S) to the compound being labeled. Techniques 
for labeling various compounds, including proteins, peptides and antibodies are well known. 
See, e.g., Morrison, Methods in Enzymology 32B, 103 (1974); Syvanen et aL, J. Biol Chem. 
284, 3762 [1973]; Bolton and Hunter, Biochem. J. 133, 529 [1973]. 

The viruses of the present invention resulted from the development of AIDS in a first 
chimpanzee infected with HTV-1 for over 10 years and by the rapid development of 
immunosuppression in a second chimpanzee transfused with blood from the first chimpanzee. 
To investigate the ability of HIV- 1 to induce AIDS in non-human primates, a cohort of 12 
chimpanzees was inoculated with several strains of HIV- 1 at the Yerkes Center in the mid- 
1980s. A member of this cohort, C499, was described as part of a previously reported 
superinfection study (Fultz et al. [1987] J. Virol. 61:4026-4029) and was inoculated on three 
separate occasions with three different HTV-1 isolates: HIV-1 SF2 in 1985, HIV-1 LAI in 1986, 
and HIV-W in 1987. The first inoculation resulted in infection, as determined by positive 
virus isolation and persistent HTV- 1 -specific antibody response. Clinically, the animal 
remained healthy except for the development of thrombocytopenia and lymphopenia in 1988, 
which resolved without treatment (Fultz et al. [1991] J. Infect. Dis. 163:441-447). (All HIV- 
infected chimpanzees are maintained in Biosafety level 3 isolation facilities.) 

In 1993, after the resumption of yearly monitoring (monitoring of all HIV- 1 -infected 
chimpanzees at the Yerkes Center was suspended from March 1990 to May 1993), a decrease 
in the levels of platelets and CD4 + cells in C499 was observed (24,000 and 390/nl, 
respectively) (Table 1). Thrombocytopenia and CD4 + -ceil lymphopenia were persistent in 
this animal from this point onward. In addition, C499 displayed other significant clinical 
signs of disease. Beginning in March 1995, C499 developed chronic, intermittent diarrhea for 
which no enteric pathogens were identified and which was not resolved with antibiotic 
treatments. In September 1995, this animal developed acute fulminant diarrhea which was 
associated with large numbers of Blastocysts hominis and Balantidium coll At this time, 
CD4 + cells decreased to extremely low levels (minimum of 10/^1:2% of total T cells) (Table 
1), indicative of severe immunosuppression. Similarly, declines were observed in the levels 
of total lymphocytes and CD8* cells (Table 1). Treatment with fluid replacement and 
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antimicrobial and antiprotozoal therapy (doxycycline, ceftriaxone, enrofloxacin, gentamicin, 
and albendazole) resulted in the resolution of acute diarrhea within five days. 

Because this was the first chimpanzee to develop AIDS, treatment with antiretroviral 
therapies was not administered in order to more fully characterize virological, immunological, 
and pathological parameters in this animal. As the acute diarrhea was resolved with 
treatment, CD4*-cell levels rose to a maximum of 1 80/ul (4'/. of total T cells) but declined 
again. Subsequently, chronic, intermittent diarrhea resumed and continued unresolved. 
During this period, C499 exhibited no lymphadenopathy or wasting. However, beginning in 
the latter part of 1995 and extending into 1996, C499 developed progressive nonregenerative 
anemia (hematocrit levels of 37.5% in December 1995 and 27.2% in January 1996, with 
respective hemoglobin values of 12.1 and 8.4 g/dl; reticulocyte counts were 0.0% since 
November 1995). Due to progressive hematologic abnormalities, chronic diarrhea, and 
continued immunosuppression, the animal was euthanized in February 1 996. 

Concomitant with the decrease in CD4* cells was an increase in HIV-1 loads in 
plasma (Table 1). The increase in the level of the virus was detected in plasma samples 
dating from May 1993, when the CD4*-cell decline was first noted, but not before the 
suspension of monitoring in 1 990. These levels are significantly higher than those for five 
other chimpanzees at the Yerkes Center which received cell-free or cell-associated HW-1 LAI 
or HIV-l sn inoculations (all have undetectable plasma viral RNA levels and are not 
immunosuppressed). These results suggest that pathogenic effects began to occur sometime 
between 1990, when CD4 + -cell counts were at normal levels and viral loads were 
undetectable, and 1993, when alterations in the number of CD4* cells and significant virus 
loads were present. 

The ability to isolate virus from C499 varied since C499's first exposure to HIV-1 . In 
general, early after inoculation, HTV was easily isolated from the peripheral blood 
mononuclear cells (PBMC) of this animal; however, after several months, virus could no 
longer be isolated. From August 1988 until the suspension of monitoring in 1990, the virus 
was consistently isolated from C499 on a monthly basis. Subsequently, after the resumption 
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of monitoring in mid-1993, HIV-1 continued to be easily isolated from this animal. 
Quantitative titration of PBMC viral load following the development of acute diarrhea in 
C499 in September 1995 until the time of euthanasia revealed that 10* to 10 5 PBMC was 
consistently required for virus isolation. The immune response of C499 to HIV-1 infection 
was very strong up to the time of euthanasia. HIV-1 antibody endpoint titers (HIV-1 whole- 
virus enzyme-linked immunosorbent assay, Genetic Systems, Redmond, WA) ranged from 
51 200 to 204,800 since 1993. Because of the deteriorating condition of this animal and 
because of the severe decline in CD4* cells, it was hypothesized that the HIV-1 present in this 
animal had evolved to become more cytopathic for chimpanzee CD4* cells. Cocultivation of 
PBMC derived from C499 (obtained at the time of acute diarrhea) with uninfected 
chimpanzee PBMC (cPBMC) resulted in the isolation of a virus (HIV-1*) which induced 
syncytium formation in chimpanzee cells (Fig. 1 A-D). This characteristic has been 
previously described for only three HIV-1 isolates (Ghosh et al. [1993] Virology 194:858- 
864; Schuitemaker et al. [1993] J. Infect. Dis. 168: 1 140-1 147; Shibata et al. [1995] J. Virol. 
69:4453-4462), none of which were used for inoculation of C499. 

The virus strain derived from C499 (HIV-l*) and other HIV-1 isolates (HIV-1 LAI and 
HIV-W were tested for the ability to induce syncytium formation in cPBMC. Virus stocks 
were prepared in cPBMC (HTV-1 JC ) or in human PBMC (HIV-1 LAI and HIV-l sra ). Cells were 
incubated with virus overnight and were then washed. Cultures were examined daily for 
evidence of cytopathic effects. cPBMC four days after infection with an HIV-1 isolate (HIV- 
1 JC ) from C499 showed beginnings of syncytium formation and separated syncytia. cPBMC 
four days after infection with HTV-lu, lacked of syncytium formation, and normal cell 
clusters were present. cPBMC four days after infection with HIV-1 SF2 also showed no 
syncytium formation. The only virus to induce significant cytopathic effects in cPBMC was 
HTV-1 JC . All cultures were examined for 14 days following infection. All cultures, regardless 
of the virus used, became positive for virus replication by seven days postinfection. Thus, it 
is concluded that genetic changes which confer the ability to induce syncytium formation 
occurred in the virus present in C499. 
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To confirm that the virus present in C499 was different from the viruses used for 
inoculation, DNA prepared from HIV-l JC -infected cPBMC was used as a template in typical 
PCR assays with primers (forward, no. 384: 

5-CCCTTCGAAGAGGATATAATCAGTTTATGGGATCAAAGC3' [SEQ ID NO:9]; 
reverse, no. 383: 5'CCCTTCGAACTCTTCTTCTGCTAGACTGCCATT3' [SEQ ID NO: 10]) 
designed to amplify a 507-bp fragment of the env gene containing the VI and V2 regions. 
Genetic analysis of 16 HTV-1 JC V1-V2 clones obtained by ligation of the amplification 
products with the vector pGEM7ZF (Promega, Madison, WI) showed amino acid homologies 
of 80 to 84% with HIV-W 73 to 80% with HTV-1 SF2 , and 63- to 68% with HIV-l^. Thus, 
there appears to be considerable divergence between the virus present in C499 at the time of 
acute disease and the viruses used to inoculate this animal. This divergence is further 
illustrated in Fig. Tables 3A-3C, which show amino acid alignments of five HTV-1 JC clones, 
HIV-W, HIV- W and HTV-1 SF2 . Comparative analyses between the 16 HTV-l rc clones 
showed that amino acid homologies ranged from 81 to 96%, with no clones being identical. 
These results suggest that the virus population in C499 consisted of a large quasispecies. 
Furthermore, the data, when combined with the in vitro analyses described above, indicate 
that the virus adapted after years of replication and mutation, becoming more pathogenic for 
the chimpanzee. While no evidence of recombination is evident from analyses performed in 
this small area, the possibility of recombination cannot be ruled out for other portions of the 
HIV- 1 JC genome. 

Tables 2A-2C illustrate amino acid alignment of V1-V2 clones obtained from HIV- 
l JC -infected cPBMC and prototypes HIV-1 LA1 and HIV-W and fflV-1^. em clones ofHTV- 
1 JC encompassing the V1-V2 region were sequenced by the dideoxy chain termination method 
(Sequenase; Amersham Life Science, Arlington Heights, IL). With the Intelligences Suite 
of programs (Intelligences, Beaverton, OR), sequences of fflV-l, em> fragments were used 
to derive corresponding amino acid sequences. Deduced amino acid sequences from five of 
these clones (HIV JC 10, HIV JC 17, HIV JC 45, HIV JC 48, and HIV JC 55) were then aligned with the 
corresponding region in HIV-W HTvMsf* and MV-l„ isolates. Amino acid sequences of 
HTV-U ,HIV-1 SF2 and HTV-W were obtained from the Human Retroviruses and AIDS 
Database. 



12 



PCT/US98/12990 

WO 98/59074 

To investigate additional pathogenic effects of HIV- 1 infection on C499, tissue 
samples obtained by biopsy during the acute diarrheal stage (September 1995) and at 
necropsy were subjected to histopathologic^ analyses. First, examination of a peripheral 
lymph node (obtained from C499 in September 1995) revealed marked lymphoid depletion 
within the cortical area, with a few follicles remaining, as compared with a lymph node from 
an age-matched, uninfected chimpanzee, which lacked follicular development and had a very 
cellular cortex. To detect virus expression in tissues, in situ hybridization experiments with 
digoxigenin-labeled riboprobes encompassing the entire HIV-1 genome (derived from the 
HTV-1 BH10 molecular clone (Hahn et al. [1984] Nature 312:166-169) were used to probe 
formalin-fixed lymph node sections. Detection of bound probes was performed with a sheep 
anti-digoxigenin-alkaline phosphatase-labeled Fab monoclonal antibody and with Nitro Blue 
Tetrazolium-5-bromo^-chloro-3-indolylphosphate toluidinium as the substrate chromogen, 
according to previously described methods (Hirsch et al. [1995] J. Virol. 69:955-967). In situ 
hybridization studies of C499's lymph node demonstrated the presence of HIV-1 RNA in 
follicular trapping patterns, with few positively staining cells. These findings are similar to 
changes observed in HIV-1 -infected persons developing AIDS (Pantaleo et al. [1995] N. 
Engl. J. Med. 332:209-216). Control samples, which included sense riboprobes and lymph 
nbdes from uninfected chimpanzees, did not show any staining. Additional lymph nodes 
obtained from C499 at necropsy showed a similar depleted pattern. However, a few lymph 
nodes were not as depleted but did contain multinucleated giant cells which stained positive 
for the HIV-1 p24 antigen. These giant cells are often found in the lymph nodes of simian 
immunodeficiency virus (SIV)-infected macaques and are occasionally found in the lymph 
nodes in HIV-l-infected persons. 

Finally, histopathologic analysis of intestinal tissue from C499 revealed pathologic 
changes in the ileum, with significant blunting of the villi and intense infiltration of 
mononuclear cells and plasma cells. Examination of the intestinal mucosa at higher 
magnification revealed extensive infection with Cryptosporidium, which was present 
throughout the small intestine, lining the apical surfaces of intestinal epithelial cells. This 
organism is an AIDS-defining opportunistic pathogen (Centers for Disease Control and 
Prevention. 1992. 1993 revised classification system for HTV infection and expanded 
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Chance case definition for AIDS among adolescents and adu!. S . Morbid. Mom!. Weekly 
Rep. 41:1-19) and probably accounted for the chronic intermittent diarrhea and mtestinal 

pathology in C499. 

Most tissues obtained from C499 following euthanasia appeared grossly norm* with 
no lymphadenopaft, or splenomegaly. However, ft, spieen showed moderate congestion and 
,acked follicle development. Bone tnarnw specimens obtained a, eudumasi. showed some 
functional impairment in CTU granulocytt macrophage and CPU formation, although levels 
of CD4- cells were no, altered (Villinger e, al. [1997] J. Med. Prim.tol 26:181-189. Vrus 
was isolattd ion, all lymphoid org™ includir* inguinal, axilla, and tnesenterie lymph 
^es as well as from the and mymus. Vhus was also isoiaKd from me kidney and 
liver bu, not from the brain or cerebrospinal fluid. 

« the lime of acute diarrhea in C499, a blood transfusion was performed to determine 
ft. effecu of passage of vina from Ins animal to anuninfected chimpanzee. Forty milUlners 
of blood obduned by vempuncnn. from C499 was immediately dansted h«v».«sly m» 
C455 This chimpanzee, which was bred in capuvHy, v» seropositive for Epaein-Ban ■ vuus 
and cytomegalov,™ and was ^negative and PCR negative for mv prior „ ft. — 
Results of titration an.ysis of PBMC andpbsma from C4„ show ft. in ft. <0 
, x 1C 50% tissue culture infective doses (TCID,,) of virus was in PBMC and 2 X 10 TC1D,„ 
of™, was in plasma. Thus, C455 received a total of 3* lO'TOD^f vuus. Anaiysisof 
periphe*. CD4- c.,1 levels in C455 revetded a precipitous decline beginning by 2 we* after 
Jfusion, when absolute numbe, ofCD4* celis decree from U40 ft 20 ««d » 
„ Tbis decline continued, .aching a minimum value of .0 ceils,ul (1% of total T - W by 
Hweekspos-transfusion. Since ftis time, the total percentage of T cells that the CD4 

number of CD4« cells (20 c.Us/ul a, ft. Udes. time point). Uvels of penpheral CDS cells 
sfcowed an initio decline from 1,590 ,0 880 cells/ul in the first two weeks after tiansfus.0.. 

• , , u jua t« o m 0 CD8* cells/ul by 8 weeks after transfusion. 
However, these levels quickly rebounded to 2,010 CDS cells/ui oy 

Since this tin*, the number of CD8* cells in circulation has been maintained between 550 and 
4,320/ul, with the level in most recent sample being 840 cells/ul. 
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Because CD4 + -cell levels declined so rapidly, the level of virus present in C455 was 
investigated by quantitation of plasma HIV-1 RNA with the Chiron B-DNA assay (Fig. 1). 
Two weeks after transfusion, plasma viral loads were 2 x 10 7 RNA equivalents/ml and 
reached maximum levels by 4 weeks post-transfusion (6.2 x 10' RNA equivalents/ml). 
Following a decline (which corresponded with the development of anti-HIV-1 spec.fic 
antibody), plasma HIV-1 levels have been maintained at -1.1 x 10 s RNA equivalents/ml. 
Virus has been easily isolated from C455 at all times post-transfusion. Quantitative coculture 
of PBMC from C455 has shown that early after transfusion, virus could be isolated from as 
few as 10'PBMC, while at more recent time points, up to 10 s to 10< cells was required for 
virus isolation. In vitro analysis of virus isolated from this animal at several time pomts has 
shown that the ability to induce syncytium formation in cPBMC has been retained, further 
implicating this virus in the pathogenesis of CD4*-cell decline. Antibody responses to HIV-1 
have been moderate in C455. Enzyme-linked immunosorbent assay titers have been 
m aintained between 1,600 and 6,400 since four weeks post-transfusion. Clinically, C455 has 
appeared normal, except for episodic incidences of a rash on the chest and in the scrotum 
area. In addition, the animal has experienced no weight loss, lymphadenopathy, or anerma. 

This invention presents an HIV-1 infected chimpanzee that developed AIDS as 
defined by the Centers for Disease Control and Prevention classification system (CDCP 1992, 
supra) Progression of clinical disease (anemia, thrombocytopenia, and chronic diarrhea) m 
this animal was associated with several key findings, including the following: (0 the 
presence of a virus which is cytopathic for cPBMC in vitro and in vivo and is generally 
distinct from those used for inoculation, (ii) an increase in viral load; (iii) CD4 + -cell 
depletion, (iv) lymph node depletion, and (v) the presence of Cryptosporidium orgarusms m 
the intestine. It appears that the critical change(s) associated with clinical progress^ may 
have developed during the period in which C499 was not monitored (mid-1990 through mul- 
1993) The precipitous CD4*-cell decline concomitant with high viral loads displayed m 
C45 5 transfused with blood from C499, suggest that the HIV present in C499 has evolved to 
become more pathogenic for chimpanzee, The increased pathogenicity of a lentivirus after 
passage into a new host has been previously observed with the adaptation of SIV from sooty 
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mangabeys to pig-tailed macaques, resulting in the development of the acutely lethal strain 
SIVsmmPBjl4 (Fultz et al. [1989] AIDS Res. Hum. Retroviruses 5:397-409). 

Forty milliliters of blood obtained by venipuncture from C455 was immediately 
transfused intravenously into C534. This chimpanzee, which was bred in captivity, was 
seropositive for Epstein-Barr virus and cytomegalovirus and was seronegative and PCR 
negative for HIV prior to the transfusion. Results of titration analysis of PBMC and plasma 
from C455 show that in the 40 ml of blood, 1x10' 50% tissue culture infective doses 
(TCID 50 ) of virus was in PBMC and 2 x 10' TCID 50 of virus was in plasma. Thus, C534 
received a total of 3 x 10' TCID 30 of virus. Analysis of peripheral CD4* cell levels in C534 
revealed a precipitous decline beginning by one week after transfusion, when absolute 
numbers of CD4* cells decreased from approximately 1,450 to 700 cells/ul (Fig. 2). The total 
percentage of T cells that the CD4* population encompasses has remained constant at 
approximately 1 %, with only a slight rise in the absolute number of CD4* cells (20 cells/ul at 
the latest time point). Levels of peripheral CDS' cells showed an initial decline ui the first 
two weeks after transfusion. However, these levels quickly rebounded after transfus.on. 

Table 3 shows the antibody response in chimpanzee C534 following transfusion with 
blood from C455 . Immediately before and at various times after transfusion, blood was 
collected from C534 for in vitro analyses of antibody response. Plasma HTV-1 RNA loads m 
C534 showed high levels of virus present by two weeks after transfusion. Results obtamed at 
the 3 -week point indicate very high levels of virus replication in C455. The cutoff level of 
10 4 equivalents/ml is the lower limit of the assay. 

Because CD4'-cell levels declined so rapidly, the level of virus present in C534 was 
investigated by quantitation ofplasmaHTV-1 antibody response (Table 4). One week after 
transfusion, the plasma antibody titer was still 0. At two weeks after transfusion, the anubody 
titer rose to 400 and remained at 1600 at three and four weeks post-transfusion. Virus has 
been easily isolated from C455 at all times after transfusion. Quantitative coculture of PBMC 
from C534 has shown that early after transfusion, virus could be isolated from as few as 10 
PBMC while at more recent time points, up to 10' to 10* cells was required for V1 rus 
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isolation. In vitro analysis of virus isolated from this animal at several time points has shown 
that the ability to induce syncytium formation in cPBMC has been retained, further 
implicating this virus in the pathogenesis of CD4*-cell decline. Antibody responses to HIV-1 
have been moderate in C534. Enzyme-linked immunosorbent assay titers have been 
maintained between 1 ,600 and 6,400 since four weeks post-transfusion. Climcally, C534 has 
appeared normal, except for episodic incidences of a rash on the chest and in the scrotum 
area. In addition, the animal has experienced no weight loss, iymphadenopathy, or anerma 
(normal hematocrit levels). 

This invention presents HIV-1 isolates NC and JC which infected chimpanzees such 
that the HIV- 1 infected chimpanzees develop AIDS as defined by the Centers for Disease 
Control and Prevention classification system (CDCP 1992, supra). Progression of cluneal 
disease (anemia, thrombocytopenia, and chronic diarrhea) in this animal is associated w.th 
several key findings, including the following: (i) the presence of a virus which is cytopatluc 
for cPBMC in vitro and in vivo and is genetically distinct from that used for inoculaUon, (n) 
an increase in viral load; (iii) CW-cdl depletion, (iv) lymph node depletion, and (v) the 
presenceofC^^ 

change(s) associated with clinical progression develop during the period spanning 
approximately 3 to 5 years after HIV-1 infection. The precipitous CD4*-cell decline 
concomitant with high viral loads displayed in C534, transfused with blood from C455, 
indicates that the HIV present in C455 remains pathogenic for chimpanzees. 

In the past, relevance of the HIV-1 -infected chimpanzee as a model for vaccine 
evaluation was questioned due to lack of disease development. Tbe lack of an animal model 
wmchsuppompathogemcHW-1^^ 

development. Although the time of progression to disease (>10 years), the currently Imuted 

chimpanzees are deterrents to their widespread use in AIDS research, the potential usefulness 
of this model cannot be disregarded. The development of AIDS in C499, the fact that 
additional HIV-infected chimpanzees have depressed CD4*-cell counts (<500„1) and 
thrombocytopenia, and the rapid progression of the CD4*-cell decline in C455 support the 
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role, that this animal model provides in AIDS-related studies. The adaptation of HIV-1 from 
long-term chimpanzee infection to a pathogenic form provides a critical link for the 
adaptation of HIV-1 to growth in more readily available nonhuman primate species. The 
instant invention further contemplates the growth of HIV-1 * isolated from C499 and/or HIV- 
1 NC from C455 in chimpanzees as well as in pig-tailed or rhesus macaques. In addition, the 
present invention provides for continued biological and genetic characterization of HIV-1 JC 
and HIV-1 NC with further key insights into the pathogenesis of HIV-1 infection in humans and 
chimpanzees, for example, for the development of drugs and vaccines for the treatment and 
prevention of AIDS. 

As described above, chimpanzee C499 was initially infected with HIV-1 SF: in 1985 
[Fultz et al. (1986) J. Virol. 58, 1 16-124] and later inoculated with HIV- Wu ™* HIV- W 
in 1986 and 1987 respectively. Superinfection with HIV-l LAV , b (but not HIV-W) was 
demonstrated by restriction enzyme analysis of PBMC genomic DNA. At the time of disease 
development in C499, a virus isolate, termed HIV-1 JC , was obtained by co-culture of C499 
PBMC with normal cPBMC. At that time, sequence analysis of the V1-V2 region of env 
suggested that HIV-1 JC was most closely related to HTV-W Alsoatthetimeofchsease 
development, blood from C499 was transfused into an uninfected chimpanzee (C455) which 
resultedinadramaticd e clbeofCD4 + cellsby2week S po S ttrar,fusion. Thedepressed 
CD4 + cell count is still maintained to date in this animal. One month post transfusion 50 ul 
of plasma from C455 was used for in vitro infection of normal chimpanzee PBMC and the 
resultant virus was designated HIV-1 NC . 

To perform a more thorough analysis of the genetic makeup of the HIV-1 JC and HTV- 
lNC viruses, we constructed substantially full-length, infectious molecular clones as descnbed 
in the Examples and in Figure 3. Both clones lacked 55 nucleotides at the 5' end (5' LTR, U3 
region) and all of the U5 region in the 3' LTR. Several clones (representing both fflV-l JC and 
Hiv-l,*) which appeared to be the correct size were tested for biological activrfy by 
transfection of CEMxl74 cells. Supematants from transfected cells were used in RT assays 
t0 monitor virus production Two clones, one from each group (fflV-l, pClfl and HIV-1 NC 
[NC7]) were positive by RT and also showed massive syncytia formation (2 to 3 days post 
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— — C.nedDNM ( o,lo„edh y 

amplification with cPBMC as outlined hereinbelow. 

r in f. and NC7 were determined as described in 
The complete nucleotide sequences of JC16 and wer 

■ An in sfO ID NO- 1 1 and SEQ ID NO:12, respectively. 
Example 7, and the sequences are given inSEQIDNU.il v 

Z nomes were detennmed to be 9193 nt (JC16) and 9196 nt (NC7) in leng* and 

7 dine (ORFs) for all HIV- 1 -specific structural, regulatory and accessory 

^entandtreatedas — 

nucleotide identities of 92, A <W»- ^ ^ ^ ja6 

most divergent region between NC7 and JC 6 w ^ ^ 

P nv eene region relative to JC16. In the L m re^ 

Compare analyses — P— »— IC ' 6 ' Ncr ^ par ™;' 

, SF2 LAV andNDK(ThesequenceofLAV.lbr,a S no.,e<been 
bocuiatu.gvm.ses.SF2, LAV, ana r. f 

defined). When ob«ved upon directcompanso . rf K16-dW ( 
ftetetf vi IUS «tio»bindingfa C «,,s eql «n« S ( S .t«fo,AP...NF AT NF *B 

TTTaR CORE and the L, S -«NA sites « conserved (or had su,gle pom. 
I^poL^ons^.o-C^Naa.d.eNKF/NKB-^^ 

The deduced amino acid sequences for an osteins of BV-1. and BIV-l. — 

(DNASTAR .nc.. Mad.so, ^" vinIses (Tlble 4,. Based 

examine ^esbeweenHTV.. HIV ,c» P ^ 
up^ percent homology calculate Gag. Pol, V,f, Tat Rev. . 
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were mos. closely related .o LAV, with vpr and Nef bemg mos, closely related to SF2. In to 
case was it apparent that the NDK isolate was the origin of a protein sequence. Whtle most 
changes invoked amino acid point mutations, severai proteins of JClo and NC7 contained 
3^0 acid tnsertions or deletions relative to the parental inoculating strains. A closer 
^vsis of ammo acid alignments revealed that the percent homologies could be misleadmg 
with regards to the origin of the protein. Forexample, i»Tat,JC,6 andNC7 weremor. 
homologous to LAV than to SF2 (based upon percent homologies,. However, the IC 1 6 and 
NC7 Tat proteins contained 1 5 amino acid deletions with respect to LAV-similar to that 
present in the SF2 isolate. Similar findings were observed in Gag and Pol. 

Table 5 shows amino acid alignments of Gag, Nef, andEnv proteins from JC16, NC7 
»d the inoculating LAV, SF2, and NDK. The deduced amino acid sequences for Vie 

Gag (a, Nef (b), and Env pro«e»s of HTv-1 were determined using the Intelhgenetics u„. 
^programs (Intelligenetics, Beaverton, OR) and the CLUSTAL method (DNASTAR Inc^, 
Mim wo was used for the alignment of proteins. The *. isolates are shovm by to 
designations a, the beginning of ev„y line as LAI (HtV-l.,), 1C.6 (HIV- W NC7 (rUV- 
, J, and SF2 (HIV-l sn ). Dashes (-, denote ammo acid deletion while do. (., denote ammo 
Z identity- ^e numbers after me amino acid sequence on the right show the pcsttron of 
the right-mos, amino acid in the line. The functional domains on Gag (a) and Nef (b) are 
Seated. MBD (a, refers to membrane-binding domain while MTD (b, refers to memfcane- 
.argeting domain. The hypervari»b,. regions in the Env glycoprotein (c) are shown as V, 
V5 and the CD4 binding domain is shown above me sequenee. Gpl20 <SU, ts the outer 
surface membrane Env glycoprotein. The NDK Env, LAI Env, and SFS Env protern 

sequences, and 5 (NC7 sequel.) amino acid deletions in the m«nx protem (p.7, relative to 

SF2 isola* sequence. A, the C-terminal - - P— «" " 
virus had 12 amino acid deletion in p6 protein relative to LAV vims sequence (Table 5 . 
However, the capsid (p24) and nucleoid <p7> proteins were genera* well conserved 

including tit. cysteine — *■ *»«-« " 
JC1« and NC7 were present in p.7, p24, p7 andp6 peptides. The Lck bindmg domam 
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(proline-rich region) within Nef was well conserved with only one point mutation in LAI 
Lie 5) However, there were 4 amino acid insertions in the SF2 sequence relative to the 
other viruses at the N-terminal portion of Nef. Sequence analysis of eight other non- 
infectious clones (4 from JC and 4 from NC) confirmed observations made for the Gag and 
Nef deletions suggesting that these characteristics are a general property of the .ruses 
obtained from C499 and C455. 

The vas. major,* of pom. -ate deletions, and insertions in these clones, relative 
„ the pMrt viruses, were found in the env glycoprotein region (Tab.es 4 and 5). 
Aliment of the En, proteins revealed that Ml .he cys.eine residues resident « the protem 
werecOT s«rvedbe W «nu,evi re ses.U«rew. re a-of30pr.dic.edN.l Wl ed 

g ly cos,.a,ion sciences (Asn-X-Thr or Asn-X-Ser, ft, the parent srrains compared .27 
J C 7) and 29 (JC16) fb, .he progeny vhuse, Most of .he g.yeosylatio„ si,es res.de . ft. 

(25 for SF2, 24 for LAI and IC, and 23 for NQ. Glycosylauon of glycoprotems has e«, 
shoml . influence ft. — -pons. .oward vius infectic. As »'» 
< SU) glycoprotein conned ft. highest, of—. The V1-V2, V3, V , and V5 

iZLu. — >»• «■ - 3 ^ m " ioM " spe0,ively spK 0 T 

NC7 The V I-V2 region of JC16 ahd NC7 also contained insertions relative ,0 ft. ofter 

(NDK). The. were mu,tip.e amino rid ft**, in ft. V4 region and stngle mm** 
insertions in V5 regions of JC.6 andNC7 relative .o the parental strains. Bod. the CD4- 
bmding domain and the pro.eo.ydc cleavage site (REKK, . the SU/TM junction were 

perfectly conserved. 

The V3 region was the mos. bwrfn, of the hynemriable regions. While (he 
Stiains contained on,, » basic amino acids ,Arg, Lys, and H, positive., charged 
JC1S andNC7 HIV-. isolates had ,2 basic ahd 2 negative., charged residues (Asp 
Z*It» gave ti,«progeny. m sesa K ,positivecha I geof.O to d K en W e V3re g ,on 
and an overal. positive chaxg. of +1 (LAV) or « (SF2) compared .0 the parent* 
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strains Eight of the 10 resultant positive charges for JC and NC isolates are located between 
residues 10 and 27 of V3 (Fig. 3) compared with 5 of 9 (LAV) and 4 of 9 (SF2) in the same 
region At least within this region, JC and NC isolates seem to have a relatively high net 
positive charge of 4 (relative to SF2) and 3 (relative to LAV). Other researchers have shown 
that changes in basic amino acids in the middle portion of V3 loop (residues 10 to 27 in Table 
5) can alter the syncytium-inducing properties and phenotype of the virus [Bhattacharyya et 
al (1996) AIDS Res. Hum. Retroviruses 12, 83-90; De Wolf etal. (1994) AIDS Res. Hum. 
Retroviruses 10, 1387-1400; Okada et al. (1994) AIDS Res. Hum. Retroviruses 10, 803-81 1]. 

To study the biological activities of the cloned and uncloned viruses derived from 
C499 and C455, we conducted in vitro replication studies. Figures 4A, 4B and 4C shows the 
results of replication studies in mitogen-stimulated cPBMC. Two ofthe three viruses used 
for inoculation of C499, SF2 and LAV-lb, were able to replicate in stimulated cPBMC, albeU 
with different kinetics (Fig. 4A). The SF2 isolate grew very slowly and to low titers in 
cPBMC In contrast, the LAV-lb isolate grew very well and with rapid kinetics, with a high 
titer of virus already present by day 7 post infection. The SF2 isolate was unable to induce 
detectable syncytium formation in cPBMC. The LAV-lb isolate, under these condi.ons 
induced very few syncytia, in contrast to previously reported results [Watanabe et al. (1991) J. 
Virol. 65 3344-3348]. Included in these analyses was the DH12 isolate of HIV-1 [Shibataet 
al (1995) J Virol. 69, 4453-4462]. This primary isolate from a human has been shown to be 
highly cytopathic for cPBMC. While this virus quickly established infection in the stimulated 
cell population, it didnot grow to high titers. The numerous syncytia formed infection w^ 
DH12 could account for the lack of growth observed. The uncloned and cloned viruses of JC 
(Fig 4B) and NC (Fig. 4C) replicated to levels comparable to those of LAV-lb for the same 
period of time. The rates of replication for cloned and uncloned viruses were 
indistinguishable. 

The ability of HTV-1 vitus isolates to replicate insulated cPBMC was similarly 
^ Results of this -ay (Fig. *K 4E and 4F> showed iha. only the NC (cloned and 
uncloned) and the LAV-lb isolates of HTV-1 were capable of significant replicattonu, 
unstimulated cPBMC. Replication rates to, *ese viruses wet. less that, .0% those observed 
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in stimulated cPBMC. Additionally, the kinetics of virus production in unstimulated cells 
was much slower than that observed in stimulated cPBMC. Interestingly, JC (cloned and 
uncloned) isolates of HIV-1 failed to replicate in cPBMC, reflecting an inherent bologKa. 
difference between the JC and NC viruses. While the DH12 folate was able to rephcate m 

The SF2 isolate was unable to replicate in unstimulated PBMC. Virus recovered from the 
secular clones displayed the intrinsic relative properties exhibited by the viruses they 
were derived from. 

Report, of HIV-1 isolates able to replicate in chimpanzee macrophages has been 
eontrovercial. To investigate the ability ofth.se viruses and their respective clones to 
replicate in macroptages, we conducted fa vitro assays using purifred chimpan.ee mo^ „cyte- 
derived macophages (MDM). Figure 5 shows virus production in macropl»ses at 7 and 
days post infection. Both the HIV-, JC and NC virus isolates (cloned and ttnctaed) 
heated in MDM as determined b, the levels of P 24 antigen produced. However, the 
amotm. of vtrus produced by the HIV-W molecuk, clone a, 14 days post infectton was 

compare for the same penod of time. Among the viruses that were .opto for MDMJfflV. 

(Lloned, produced the leas, amount of virus. HTV-W.,. - »V-. m ,^o mfected 
MDM and produced virus which is consistent with previous observance made by 
GendelmanetaUGendelmanetaU™,),. M 65, 3853-3863, and Shibataetal 
, S bibatae,a,.(199 5 )./. Viro,. 69, 44534462] re^ctively. Unlike HTV-W me other 
parental virus, HTV-1„, did not replicate in MDM. 

Because the cloned viruses displayed such similar in vitro bi.logrc acuvities as the 

« m*. Genomic DNA was isolaed and nested PCR was used to ampltfy 

V3-V5 regions of HIV-1 envelope gene. These hypervariable regions of e*v gene were 

selecKd because most viral heterogeneity has been associated «* them. Eoual amounts of 

derivlfroms m da,ds(SF.6 2 s,b W eB3a,dZM, 8 sub.pea)or ft om I C16 m dNC7 
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modular HeCrophoresis of mese fragment on polyacryiamid. no,—* gels 

duplex,, Homoduplexes based on g enom,c DNA (HIV-I* or Ht* W — - 
bomodup ex 0~> - b— <UPP«) bands - « -paraHe _ and wore 
LJ«d.er^.l1.eHIV-,^V.,„,(b«o t H I V-,^., HC ,,h«= t od»p«x^ 

» berween the homodupiex and hcerodupLx bands ob.ained wim genomic DNA 
fcomodupfcxes. 7b. distance beU^en homoduple* and hceroduplex bands formed be^n 
J C,6=ndNC7molee» to e,ones-^« te , 1 b..be»«enba„dsfon»edfrom 

n ° 8 raJa,aunos tteS a m e ra ,ea steS sDNAb„,.beZM, 8 (sub OT eC 2 ,he„ ro d U p 1 ex 

bands had the slowest migration. 

M advance of es— shing a mode, system for AIDS in smaller nonhuman pnma.es 
„ me decreased cos. associaKd v*h me housing ar,d upkeep of smalier arumals Arnrnal 
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Antigenic fragments of the present invention are peptides which contain at least one 
epitope (antibody binding site) which binds antibodies which bind to at least one HIV-1 
isolate of the present invention. The antigenic fragments are preferably capable of inducmg 
an immune response when administered to a nonhuman primate. DNA encoding such 
antigenic fragments may be used to transform host cells to thereby produce such anfgemc 
fragments. 

Antigenic fragments may be identified by a variety of means. A protein from HIV-1 JC 
and/or from HIV-1 NC , such as an envelope protein, may be fragmented with a protease, and 
the fragments tested to determine whether or not various ones react with antiserum agamst the 
protein See, e.g., J. Robinson et al., Mol. Cell BiocHem. 21:23-32 (1978). Another technique 
is to synthesize peptides which are fragments of the entire protein and determine whether the 
individual fragments are recognized by neutralizing antibodies against the protem. See, e.g., 
J Gerin et al., in Vaccines 85: Molecular and chemical Basis of Resistance to Paras** 
Bacterial and Viral Diseases, 235-239 (Lemer et al., eds. 1985). Still another method useful 
for obtaining immunogenic fragments of a protein is by isolation and identified of 
monoclonal escape mutants. In this strategy, HIV-1 is produced in the presence of a 
monoclonal antibody to the virus. The only viruses which can grow under these condUums 
a* those with a mutation in the nucleotide sequence which codes for an epitope to wluch the 
monoclona! antibody bind, A mutant virus which grows under these conditions is referred to 
as the "monoclonal escape mutant." The monoclonal escape mutant is then sequenced and 
the mutant sequence compared with the nucleotide sequence of the HTV-1 JC isolate or the 
fflV-U isolate to find the specific location of the mutation. The mutation is located m a 
region which codes for a protective epitope, or an "immunogenic fragment." See, e.g., J. 
Lopez etal., J. Virol. 64:927 (1990). 

Antigenic preparations of the present invention are useful as reagents in immunoassay 
diagnostic studies of retroviruses. Immunochemical methods for detecting retrovnuses 
include, for example, immunofluorescence assays or irnmunoenzymatrc assays, 
immunofluorescence assays typically involve incubating, for example, serum from the subject 
t0 be tested with preparations of the pathogenic virus or fragments thereof. Immune 
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complexes formed are detected using either direct or indirect methods, for example, the use of 
antibodies to which fluorescent labels such as rhodamine or fluorescein have been coupled. 
Immunoenzymatic assays typically involve viral extracts or other antigen-containing 
compositions bound to a surface. Serum from a subject to be tested for the presence of 
antibodies directed against one or more antigens is contacted with the surface and, after a 
period of incubation, unbound substances are washed away. The presence of immune 
complexes is detected using antibodies labeled with an enzyme such as horseradish 
peroxidase, alkaline phosphatase, or beta-galactosidase, which is capable of converting a 
colorless or nearly colorless substrate into a highly colored product, or an enzyme which 
emits light in the presence of the proper substrate. The amount of product formed is detected 
visually, spectrophotometrically, or luminometrically and is compared to a similarly treated 
control. The presence of antibodies in biological fluids may also be detected by 
agglutination. Viral lysates or antigen compositions are used to coat, for example, latex 
particles. 

Diagnostic tests utilizing the present invention may be carried out in accordance with 
known techniques. Such techniques provide a method of detecting the presence of fflV-1 by 
detecting the presence of fflV-1 antibodies. Such methods comprise collecting an antibody- 
containing biological sample (e.g., blood, blood sera, blood plasma, cerebrospinal fluid, tissue 
samples) from the subject, contacting the sample with an antigenic preparation of the viral 
particles of the present invention as given herein, and then detecting the formation of a 
reaction product between the antibodies in the sample and the antigenic preparation. Any 
suitable assay format, such as radioimmunoassay (RIA) or enzyme-linked immunosorbent 
assay (ELISA) may be employed, in accordance with known techniques. See, e.g., 
Immunology: Basic Processes, 162-175 (J. Bellanti, [Ed.] 2d ed., W.B. Saunders Co. (1985). 

Also disclosed herein are kits for the detection ofHTV infection. Such kits comprise a 
container containing an antigenic preparation of the viral particles of the present invention, 
which may be lyophilized. The antigenic preparation may comprise, e.g., thefflV-1 
envelope protein (env) or the group antigen (gag) protein of the HTV-1 IC and/or HTV-1 NC of 
the invention. 
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A method of inducing antibodies to HTV-1 in a subject, as disclosed herein, comprises 
administering to a subject an immunogenic amount of infectious viral particles of the present 
invention. This method may be used to make polyclonal or monoclonal antibodies, which 
may be used in diagnostic assays. Suitable subjects include mammals (such as, for example, 
rats, rabbits, mice, and horses) and primates. The term primates is herein intended to 
encompass any members of the order Primata (for example, lemurs, mandrills, rhesus 
monkeys, macaques, and chimpanzees) and to include humans. Suitable subjects include 
those in which antibodies to HIV may be raised (e.g., rabbit, horse). 

In the above methods of inducing antibodies, viral antigenic preparations of the 
present invention may be combined with any suitable pharmaceutically acceptable carrier 
(such as sterile, pyrogen-free physiological saline solution, or sterile, pyrogen-free phosphate- 
buffered saline solution). The viral antigens are included in an effective immunogenic 
amount. The precise amount to be administered to a given subject is determined by 
techniques known in the art and will vary depending on the route of administration, the 
subject and the desired response. Administration to the subject may occur by any suitable 
route (e.g., by intramuscular injection, subcutaneous injection, intraperitoneal injection, or 
intravenous injection). The appropriate immunogenic dosage will depend upon the particular 
subject and the desired outcome. Techniques to determine a particular immunogenic amount 
of the viral particles of the present invention will be apparent to those of ordinary skill in the 
art. See, e.g., Johnson et al., Proc. Natl. Acad. Set. USA 89:2175 (1992). For example, the 
active agent (viral particles or preparations thereof) may be given in an amount of from 0.05 
to 50 ug per kg body weight (e.g., 0.5 or 1.0 ug per kg). 

The invention also provides for a variety of different vaccines based on the structures 
of the HIV-1 JC isolate and/or the HIV-1 NC of the invention and a method for vaccinating a 
population against HTV. Examples of active agents used for the preparation of a vaccine of 
the invention include the live attenuated HTV-1 JC and/or HIV-1 NC isolate, fixed whole virus, 
host cells expressing virus antigen, preparations of virus fragments, purified proteins, 
antigenic fragments of proteins and antigenic peptides which are derivatives of the antigenic 
fragments. According to the present invention, HIV-1 JC - and/or HIV-l NC -derived 
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compositions or vaccines are useful for preinfection immunization of primates as well as for 
postinfection (therapeutic) immunization of HIV-infected primates (see Fultz et al. [1989], 
Lemer et al., eds. Cold Spring Harbor, NY). 

Live attenuated HIV isolate JC virus (or HIV-1 NC virus) is prepared by serial passage 
of the virus in tissue culture or genetically altered by recombinant techniques, in accordance 
with known procedures. Fixed virus is made by contacting live virus (attenuated or 
unattenuated) to a suitable fixative, such as formalin. 

Preparations of viral fragments are made by lysing host cells, such as E. coli cells, 
transformed with a vector encoding an HIV-1 isolate of the present invention (or both) or a 
portion thereof. The lysate may be used in crude or partially purified form, or a particular 
viral protein (or antigenic fragment thereof) such as the envelope protein, can be purified to 
homogeneity and used as an active agent for a vaccine against HIV-1. Host cells such as 
yeast cells may be transformed with vectors of the present invention capable of expressing 
HIV-1 proteins, or antigenic fragments thereof, on the surface of the host cells, and the 
transformed host cells used as an active vaccine agent as such or fixed (e.g., with formalin) 
and used as an active agent. 

Antigenic peptides are selected from the group consisting of antigenic fragments of 
HIV isolate JC and/or NC proteins, such as the envelope protein, and the antigenic 
equivalents thereof (i.e., analogs or derivatives). Antigenic peptides may be chemically 
synthesized or produced by recombinant techniques. 

Viral antigenic preparations and cells producing viral antigens and/or fragments 
thereof may be formulated into immunogenic compositions as neutral or salt forms. 
Preferably, when cells are used they are of avirulent strains, or the cells are killed before use. 
Pharmaceutically acceptable salts include but are not limited to the acid addition salts (formed 
with free amino groups of the peptide) which are formed with inorganic acids, e.g., 
hydrochloric acid or phosphoric acids; and organic acids, e.g., acetic, oxalic, tartaric, or 
maleic acid. Salts formed with the free carboxyl groups may also be derived from inorganic 
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bases, e.g., sodium, potassium, ammonium, calcium, or ferric hydroxides, and organic bases, 
e.g., isopropylamine, trimethylamine, 2-ethylamino-ethanol, histidine, and procaine. 

The term "antigenic equivalents," as used herein, refers to proteins or peptides which 
bind to an antibody which binds to the protein or peptide with which equivalency is sought to 
be established. Antibodies which are used to select such antigenic equivalents are referred to 
as "selection antibodies" herein. Preferred selection antibodies are monoclonal antibodies 
which bind to HIV isolate JC and/or to HIV-l NO but not to prior isolates of HIV- 1 such as the 
HIV-1 isolates NDK, LAI and SF2, for example. 

One or more amino acids of an antigenic peptide sequence may be replaced by one or 
more other amino acids which do not affect the antigenicity of that sequence. Such changes 
can be guided by known similarities between amino acids in physical features such as charge 
density, hydrophobicity/hydrophilicity, size and configuration, For example, threonine and 
serine can be interchanged, or aspartic acid and glutamic acid, or leucine and isoleucine, and 
the like. 

Antigenic equivalents may be formed by modifying reactive groups within a natural 
sequence or modifying the N-terminal amino and/or C-terminal carboxyl group. Such 
equivalents include salts formed with acids and/or bases, particularly physiologically 
acceptable inorganic and organic acids and bases. Other equivalents include modified 
carboxyl and/or amino groups on the synthetic peptide to produce esters or amides, or amino 
acid protecting groups such as N-t-butoxycarbonyl. Preferred modifications are those which 
provide a more stable, active peptide which will be less prone to enzymatic degradation in 
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In a particular embodiment of the invention, polyclonal and/or monoclonal antibodies 
capable of specifically binding to a particular epitope of at least one HTV-1 isolate of the 
invention are provided. The term antibody is used to refer both to a homogenous molecular 
entity (monoclonal antibody) and a mixture (such as a serum product) made up of a plurality 
of different molecular entities (polyclonal antibody). Monoclonal or polyclonal antibodies, 
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and preferably monoclonal, specifically reacting with a particular epitope of interest can be 
made by methods known in the art. See, e.g., Harlow and Lane (1988) Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratories; Goding (1986) Monoclonal 
Antibodies: Principles and Practice, 2d ed., Academic Press, New York. Also, recombinant 
immunoglobulins may be produced by methods known in the art, including but not limited to 
the methods described in U.S. Patent No. 4,816,567, incorporated by reference herein. 
Monoclonal antibodies with affinities of 10 8 M' 1 , preferably 10' to 10'° or more are preferred. 

For use as a vaccine, immunogenic compositions may be formulated by any of the 
means known in the art. Such vaccines are typically prepared as injectables, either as liquid 
solutions or suspensions. Solid forms suitable for solution in, or suspension in, liquid prior to 
injection may also be prepared. The preparation may also, for example, be emulsified, or the 
protein encapsulated in liposomes. Such vaccines may be administered to the subject by any 
suitable means, for example, by intramuscular injection, by subcutaneous injection, by 
intravenous injection, by intraperitoneal injection, by oral injection, and by nasal spray. 

The vaccine or other immunogenic composition may be given in a single dose or 
multiple dose schedule. A multiple dose schedule is one in which a primary course of 
vaccination may include 1 to 10 or more separate doses, followed by other doses administered 
at subsequent time intervals as required to maintain and or reinforce the immune response, 
e.g., at 1 to 4 months for a second dose and, if needed, a subsequent dose(s) after several 
months. 

The immunogenic peptide antigen compositions are administered in a manner 
compatible with the dosage formulation and in such amount as will be prophylactically and/or 
therapeutically effective. The quantity to be administered, which is generally in the range of 
about 100 to 1 ,000 jig of protein per dose, more generally in the range of about 5 to 500 ug of 
protein per dose, depends on the subject to be treated, the capacity of the individual's immune 
system to synthesize antibodies, and the degree of protection desired. Precise amounts of the 
active ingredient required to be administered may depend on the judgment of the physician 
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and may be peculiar to each individual, but such a determination is within the skill of such a 
practitioner. 

Vaccine formulations of the present invention comprise the active agent mixed with 
excipients or carriers which are pharmaceutically acceptable and compatible with the active 
ingredient. Suitable excipients include but are not limited to water, saline, dextrose, glycerol, 
ethanol, or the like and combinations thereof. The concentration of the immunogenic 
polypeptide in injectable formulations is usually in the range of 0.2 to 5 mg/ml. 

In addition, if desired, the vaccines may contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH buffering agents, and/or adjuvants 
which enhance the effectiveness of the vaccine. Examples of adjuvants which may be 
effective include but are not limited to: aluminum hydroxide; aluminum phosphate; plant and 
animal oils; synthetic polymers; e.g., N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr- 
MDP); N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 1 1637, referred to as nor- 
MDP)! N-acetylmuramyl-L-al^ 

3-hydr'o X yphosphoryloxy)-ethylamine (CGP 19835A, referred to as MTP-PE); and RIBI, 
which contains three components extracted from bacteria, monophosphoryl lipid A, trehalose 
dimycolate; cell wall skeleton (MPL+TDM+CWS) in a 2% squalinertween 80 emulsion; etc. 

In addition, the vaccine formulations may also contain one or more stabilizer, for 
example, carbohydrates such as sorbitol, mannitol, starch, sucrose, dextrin, and glucose, 
proteins such as albumin or casein, and buffers such as alkaline metal phosphates and the like. 

In view of the similarities in protein molecules making up different isolates of HIV- 1, 
the skilled artisan understands that an antibody, particularly a monoclonal antibody, which is 
specific for a particular epitope directed to a particular protein of the HIV-1 JC isolate and/or 
the HTV-1 NC isolate, can be used to screen for other HIV-1 isolates having similar epitopes 
recognized by that (monoclonal) antibody. 
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Antibodies generated against specific epitopes of the HIV-1 JC of the invention are 
useful, for example, as probes for screening DNA expression libraries or for detecting the 
presence of HIV-1 strains in a test sample. Antigens can be synthesized and conjugated to a 
suitable carrier protein (e.g., bovine serum albumin or keyhole limpet hemocyanin) for use in 
vaccines or in raising specific antibodies. Frequently, the polypeptides and antibodies will be 
labeled by joining, either covalently or noncovalently, a substance which provides a 
detectable signal. Suitable labels include but are not limited to radionuclides, enzymes, 
substrates, cofactors, inhibitors, fluorescent agents, chemiluminescent agents, magnetic 
particles and the like. United States Patents describing the use of such labels include but are 
not limited to Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 
4,355,241. 

Immunogenic carriers may be used to enhance the immunogenicity of an active agent. 
Such carriers include but are not limited to proteins and polysaccharides, liposomes, and 
bacterial cells and membranes. Protein carriers may be joined to the capsular polysaccharide 
molecules to form fusion proteins by recombinant or synthetic means or by chemical 
coupling. Useful carriers and means of coupling such carriers to polypeptide antigens are 
known in the art. The art knows how to administer immunogenic compositions so as to 
generate protective immunity where immunity is most helpful. 

Compositions and immunogenic preparations including vaccine compositions 
comprising substantially purified antigens derived from an HIV-1 isolate JC and/or NC and a 
suitable carrier therefor are provided. Immunogenic compositions are those which result in 
specific antibody production when injected into a human or an animal. Such immunogenic 
compositions are useful, for example, in immunizing primates against infection by HIV-1 
strains. The immunogenic preparations comprise an immunogenic amount of, as specifically 
exemplified, at least one antigenic determinant derived from the HIV-1 JC isolate of the 
invention and a suitable carrier. Alternatively, the immunogenic composition can comprise 
host cells harboring an antigenic agent from the specifically exemplified HTV-1 JC strain and a 
suitable carrier. It is understood by one of ordinary skill in the art that a functionally 
equivalent, recombinant mutant of HTV-1 JC and or HIV-1 NC can be produced by the 
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introduction of the cloned DNA containing the insertion mutations responsible for a desired 
characteristic. It is also within the scope of the present invention and readily within the grasp 
of the ordinary skilled artisan to generate other types of genetically stable mutations in the 
structural or enzyme genes of ffiV-1. Such immunogenic compositions (or vaccines) are 
useful, for example, in immunizing an animal, especially humans, against AIDS and related 
diseases resulting from infection by HIV-1 species. Such immunogenic compositions can 
also elicit the production of antibodies which will cross react with proteins of other HIV-1 
and HIV-2 strains expressing epitopes in common with those of the starting HIV-1 JC isolate. 
It is understood that where whole cells are formulated into the immunogenic composition, the 
cells are preferably inactivated, especially if the cells are of a virulent strain. Such 
immunogenic compositions may comprise one or more protein or the immunogenic cellular 
component. By "immunogenic amount" is meant an amount capable of eliciting the 
production of antibodies directed against an antigenic agent of HIV- 1 « in an animal or human 
to which the vaccine or immunogenic composition has been administered. 

The nucleotide sequence of the fflV-l JC isolate or the HIV-1 NC isolate can be used to 
generate hybridization probes which specifically bind to HIV-1 JC genetic material, or to DNA 
ofHIV-1 isolates having the identifying characteristics of the HIV-1 isolates JC or NC, to 
determine the presence of such HIV-1 in primates. The hybridization probe may be selected 
so that it does not bind to other known HIV-1 isolates such as NDK, LAI, SF2, etc. The 
hybridization probes can be cDNA fragments or polynucleotides and may be labeled with a 
detectable group, as is well-known in the art. Pairs of probes can serve as PCR primers for 
synthesis and amplification processes in accordance with the description, for example in U.S. 
Patent Nos. 4,683,202 and 4,683,195. 

In specific embodiments, probes of the invention comprise DNA sequences of HIV- 
1 JC or sequences encoding antigenic fragments thereof or sequences having identity thereto. 
In particular are provided probes having a DNA sequence as set forth in SEQ ID NO:l or a 
sequence having identity thereto. 
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The production of DNA, vectors, transformed host cells, HIV-1 virus, proteins, and 
protein fragments of the present invention by genetic engineering techniques can be carried 
out in accordance with methods known in the art. See, e.g., U.S. Pat. No. 4,761,371, U.S. 
Pat. No. 4,877,729, U.S. Pat. No. 4,912,038, and U.S. Pat. No. 4,879,224, among others. 

A nucleotide sequence (polynucleotide) or fragment thereof is substantially 
homologous (or substantially similar) to another polynucleotide if, when optimally aligned 
(with appropriate nucleotide insertions or deletions) with another polynucleotide, there is 
nucleotide sequence identity for approximately 80% of the nucleotide bases, usually 
approximately 90%, more preferably about 95% to 100% of the nucleotide bases. Gaps 
introduced to optimize alignments are treated as mismatches. 

Alternatively, substantial homology (or similarity) exists when a polynucleotide or 
fragment thereof will hybridize to another polynucleotide under selective or stringent 
hybridization conditions. Selectivity of hybridization exists under stringent hybridization 
conditions which allow one to distinguish the target polynucleotide of interest from other 
polynucleotides. Typically, selective hybridization will occur when there is approxxmately 
75% similarity over a stretch of about 14 nucleotides, preferably approximately 80% 
similarity, more preferably approximately 85% similarity, and most preferably approximately 
90% similarity. See Kanehisa (1984) Nucl. Acids Re,, 12:203-213. The length of homology 
comparison, as described, may be over longer stretches, and in certain embodiments wnl 
often be over a stretch of about 17 to 20 nucleotides, preferably 21 to 25 nucleotides, more 
preferably 26 to 35 nucleotides, and more preferably about 36 or more nucleotides. 

The hybridization of polynucleotides is affected by such conditions as salt 
concentration, temperature, or organic solvents, in addition to the base composition, length of 
the complementary strands, and the number of nucleotide base mismatches between the 
hybridizing polynucleotides, as will be readily appreciated by those skilled m the art. 
Stringent temperature conditions will generally include temperatures in excess of 30°C, 
typically in excess of 37°C, and preferably in excess of 45°C. Stringent salt conditions wnl 
ordinarily be less than 1 M, typically less than 500 mM, and preferably less than 200 mM. 
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However, the combination of parameters is much more important than the measure of any 
single parameter (Wetmur and Davidson [1968] J. Mol. Biol. 31:349-370). 

An isolated or substantially pure polynucleotide is a polynucleotide which is 
substantially separated from other polynucleotide sequences which naturally accompany a 
native sequence. The term embraces a polynucleotide sequence which has been removed 
from its naturally occurring environment, and includes recombinant or cloned DNA isolates, 
chemically synthesized analogues and analogues biologically synthesized by heterologous 
systems. 

A polynucleotide is said to encode a polypeptide if, in its native state or when 
manipulated by methods known to those skilled in the art, it can be transcribed and/or 
elated to produce the polypeptide of a fragment thereof. The anti-sense strand of such a 
polynucleotide is also said to encode the sequence. 

Vectors are replicable DNA constructs used to either amplify or express DNA of the 
present invention. An expression vector is a replicable DNA construct in which DNA of the 
present invention is operably linked to control sequences capable of expressing that DNA m a 
suitable host. Generally, control sequences include a transcriptional promoter, an optxonal 
operator sequence to control transcription, a sequence encoding suitable mRNA ribosomal 
binding sites, and sequences which control the termination of transcription and translation. 
Suitable vectors include plasmids, viruses (eg., vaccinia virus, adenovirus, baculovtrus, 
cytomegalovirus) phage, and intertable DNA fragments (i.e., fragments intertable mto the 
host genome by recombination). 

DNA regions are operably linked or operably associated when they are functionally 
related to each other. For example, a promoter is operably linked to a coding sequence if tt 
controls the transcription of the sequence; or a ribosome binding site is operably baked to a 
coding sequence if it is positioned so as to permit translation. 
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Transformed host cells are cells which have been transformed or transfected with 
vectors as described above. Transformed host cells ordinarily express the DNA of the present 
invention. Suitable host cells include prokaryote, yeast or higher eukaryotic cells such as 
mammalian cells and insect cells. 

Prokaryote host cells include gram negative or gram positive organisms, for example 
EscHericMa coli (E. coW or Bacilli. Exemplary host cells are * coli W3 11 0 (ATCC 27,325), 
KcoHB E C o/ I X1776(ATCC 31,537),£c 0 /«294(ATCC31 ) 446). A broad variety of 
suitable prokaryotic and microbial vectors « available. E. coli is typify transformed usmg 
pB R322 Promoters most commonly used in recombinant microbial expression vectors 
include the P-lactamase (penicillinase) and lactose promoter systems (Change et a.., Nature 
275-615 [1978]; and Goeddel et al., Nature 281:544 [1979], a tryptophan (up) promoter 
system (Goeddel et al. [1980] Nuc. Acids Re, 8:4057, and EPO App. Publ. No. 36,766) and 
the tac promoter (H. De Boer et al., Proc. Natl. Aca, *L USA 80:21 [1 983]). Tta promoter 
and Shine-Dalgamo sequence are operably linked to the DNA of the invention, i.e., they are 
positioned so as to promote transcription of messenger RNA from the DNA. 

Eukaryotic microbes such as yeast cultures may also be transformed with vectors of 
the present invention. See, e.g., U.S. Pat. No. 4,745,057. SaccHaromyces cerevisiae . the 

cells (e g cultured Spodopterajrugiperda cells) and expression vectors such as the 
baculovirus expression vector may be employed in carrying out the present invenUon, as 
described in U.S. Pat. No, 4,745,051 and 4,879,236 to Smith et al. 

Examples of useful mammalian host cell lines are VERO and HeLa cells, Chinese 
namster ovary (CHO) cell lines, and WI138, BHK, COS-7, CV, and MDCK cell lines. The 
transcriptional and translation* control sequences in expression vectors to be used m 
^forming vertebrate cells are often provided by viral source, For example^ commonly 
used promoters are derived from polyoma, Adenovirus 2, and Simian Vrrus 40 (SV40). See, 
eg US Pat. No. 4,599,308. An origin of replication may be provided either by cor— 
ofthe vector* include .exogenous origin, such as may be derived from SV40 or other vtral 
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(e g Polyoma, Adenovirus, VSV, or BPV) source, or may be provided by the host cell 
chromosomal replication mechanism. If the vector is integrated into the host cell 
chromosome, the latter is often sufficient. Rather than using vectors which contam vual 
origins of replication, one can transform mammalian cells by the method of cotransformaUon 
with a selectable marker and DNA of the present invention, as described in U.S. Pat. No. 
4,399,216. 

Except as noted hereafter, standard techniques for cloning, DNA isolation, 
amplification and purification, for enzymatic reactions involving DNA ligase, DNA 
polymerase, restriction endonucleases and the like, and various separation techmques are 
those known and commonly employed by those skilled in the art. A number of standard 
techniques are described in Sambrook et al. (1989) Molecular Cloning, Second Edmon, Cold 
Spring Harbor Laboratory, Plainview New York; Maniatis et al. (1982) Molecular Cloning, 
Cold Spring Harbor Laboratory, Plainview, New York; Wu (ed.) (1993) Meth. Ervymol 
218-Part I- Wu (ed.) (1979) Meth. Enzymol. 68; Wu et al. (ed..) (1983) Meth. Enzymol. 100 
and 101- Grossman and Moldave (eds.) Meth. Emymol. 65; Miller (ed.) (1972) Experiments 
in Molecular Genetics, Cold Spring Harbor Laboratory, Plainview, New York; Old and 
Primrose (1981) Principles of Gene Manipulation, University of California Press, Berkely; 
Schleif and Wensink (1982) Practical Methods in Molecular Biology; Glover (ed.) (1985) 
DNA Cloning Vol. I and II, IRL Press, Oxford, UK; Hames and Higgins (eds.) (1985) Nucleic 
Acid Hybridization, IRL Press, Oxford, UK; and Setlow and Hollander (1979) Genetic 
Engineering: Principles and Methods,^, 1-4, Plenum Press, New York. Abbreviations 
and nomenclature, where employed, are deemed standard in the field and commonly used m 
professional journals such as those cited herein. 

All publications, patent applications and patents cited herein are incorporated by 
reference to the extent that they are not inconsistent with the present disclosure. 

The foregoing discussion and the following examples are provided for illustrative 
purposes, and they are not intended to limit the scope of the invention as claimed hereu. 
Modifications and variations which may occur to one of ordinary skill in the art are wnthun the 
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intended scope of this invention. The present invention i, further described in the non- 
limiting examples set forth below. 

EXAMPLES 



Example 1. Animal Subjects 

All animals (e.g., chimpanzees) were maintained in accordance with the gmdehnes 
established by the Animal Welfare Act and the NIH guide for care and use of laboratory 
arumals. The Yerkes Center is fully accredited by the American Association for the 
Accreditation of Laboratory Animal Care (AAALAC). 



A cohort of 12 chimpanzees was inoculated with several strains of HIV-1 at the 
Yerkes Center. A member of this cohort, chimpanzee 499 was inoculated on three different 
occasions 



„ with <h-« dtt-l MV-1 U~ «V-1„ in 1985, HIV-,., in 1986 «,d HIV- 
, in 1987 (See FultzM al. [1991] J. .-feet Dis. 163:441-447 »d Novemte «. ,1. [1997] J. 

ND1C 

Virol. 71:4086-4091). 

It is preferred that a chimpanzee be bred in captivity, be seropositive for Epstein-Barr 
virus and cytomegalovirus and be seronegative and PCR negative for HIV prior to being used 
as an animal model. 

Example 2. and Purification of HrV-l lc i§Ql3tg 

The original source of the HIV-1, isolate was the chimpanzee C499. HIV was easdy 
isolated from the peripheral blood mononuclear cells (PBMC) of this animal. Cocultivauon 
of PBMC derived from C499 with uninfected chimpanzee PBMC (cPBMC) resulted m *e 
isolation of a virus (HIV-l*) which induced syncytium formation in chimpanzee cells The 
nucleotide sequence of this virus was distinguished from the isolates used for the nutuu 
inoculations and from other known HIV-1 isolates. 

The HIV-1 JC isolate was deposited with the AIDS Reagent Program, McKesson 
Bioservices, 685 Lofstrand Lane, Rockville, MD 20850 USA, a division of the NIH AIDS 
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Research and Reference Reagent Program, in May 1997 and has been assigned Catalog 
Number 3523. A substantially full-length infectious molecular clone has a nucleotide 
sequence as given in SEQ ID NO: 1 1 . 

In a specific embodiment of the invention, probes of the invention comprise DNA 
sequences as set forth in SEQ ID NO:l 1 and/or sequences of at least 15 contiguous 
nucleotides derived therefrom or sequences complementary thereto. 

The original source of the HTV-1 JC isolate was the chimpanzee C455. HIV was easily 
isolated from the peripheral blood mononuclear cells (PBMC) of this animal. CocuUivation 
of PBMC derived from C455 with uninfected chimpanzee PBMC (cPBMC) resulted in the 
isolation of a virus (HTV-1 NC ) which induced syncytium formation in chimpanzee cells. The 
nucleotide sequence of this virus is distinguished from other known HIV-1 .solates. A 
substantially full-length infectious molecular clone has a nucleotide sequence as given m SEQ 
ID NO: 12. 



In a specific embodiment of the invention, probes of the invention comprise DNA 
sequences as set forth in SEQ ID NO:12 and/or sequences, of at least 15 contiguous 
nucleotides derived therefrom or sequences complementary thereto. 

Example 3. Vir "lr anrf Viral Antigens 

The HTV-1 isolates used to inoculate C499 included LAV (lymphadenopathy- 
associated virus>l BRU (Barre-Sinoussi [1983] Science 220:868-871), SF2 (previously 
designated ARV [AIDS-related virus]-2) (Levy et al. [1984] Science 225:840-842), and 
NDK a highly cytopathic HIV-1 of African origin (Spire et al. [1989] Gene 81:275-284). 
mentions of virus were done intravenously with 1 -ml aliquots of undiluted or diluted virus 
stocks, in January 1988, 33 months after inoculation HIV-1 SR , C499 was part of a study to 
assess the effects of therapeutic vaccination on immunity and viral status in fflV-l-infected 
chimpanzees. C499 was given two intramuscular injections, 4 weeks apart, of 500 ug of 
recombinant SF2 p53 gag, produced in yeast and formulated with 100 ug of muramyl 
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tripeptide (Ciba-Geigy, Summit, NJ) in 4% squalene and 0.008% Tween 80 (Fultz et al. 
[1989] in Vaccines 89, supra). 

The HIV-1 isolates used to inoculate C455 was JC, deposited with the AIDS Reagent 
Program, Catalog Number 3523. Inoculations of virus were done intravenously with 1-ml 
aliquots of undiluted or diluted virus stock. C534 was given two intramuscular injections, 4 
weeks apart, of 500 ug of recombinant SF2 p53 gag, produced in yeast and formulated with 
100 ug of muramyl tripeptide (Ciba-Geigy, Summit, NJ) in 4% squalene and 0.008% Tween 
80 (Fultz et al. [1989] in Vaccines 89, supra). 

Example 4. Serologic Assays 

Serum samples were tested by indirect ELISA for antibodies to specific HIV-1 
proteins using a series of recombinant antigens. These antigens have been described (Fultz et 
al. [1989] in Vaccines 89, supra) and included p25 gag and p53 gag, produced in yeast; p31 
pol; and e«v2-3(SF2) and e«v2-3 (IITB), nonglycosylated polypeptides corresponding to full- 
length gpl20. Antibody titers to whole-virus preparations were determined with an HTV 
enzyme immunoassay kit (EIA; Genetic Systems, Seattle, WA). Neutralization assays were 
performed as described (Fultz et al. [1986] Proc. Natl. Acad. Sci. 83:5286-5290); titers were 
based on >80% inhibition of reverse transcriptase (RT) activity after preincubation of virus 
with serum and infection of normal human PBMC. Antibodies cross-reactive with histone 
H2B were identified by immunoblot using purified calf thymus histones as described 
previously (Strickler et al. [1987] Nature 327:710-713). 

Example 5. Virus Assays 

Assays to detect virus in plasma samples or to recover virus from chimpanzee PBMC 
were performed with normal human PBMC as indicator cells, unless otherwise indicated, and 
have been described (Fultz et al. [1986] J. Virol. 58:1 16-124). Medium was RPMI 1640 with 
10% fetal bovine serum, 8 units of recombinant interleukin-2 (IL-2)/ml, glutamine, and 
antibiotics (RPMI-IL-2). Infectious PBMC were quantified by serial 1:5 limiting dilution and 
cocultivation with human PBMC (Fultz et al. [1986], supra). HIV-1 was detected in all assays 
by the presence of RT activity in cell-free culture supematants. To detect HIV antigen in 
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serum or plasma samples, a commercially available HIV-1 antigen capture kit (Coulter, 
Hialeah, FL) was used. 

To assess the influence of CD8* lymphocytes on viral replication and recovery, after 3 
days of stimulation with concanavalin A (ConA 10 ug/ml) in RPMI 1640, PBMC were 
washed once; 10 7 cells from C499 were placed in fresh RPMI-IL-2, cultured without indicator 
cells, and monitored for HIV-1 production. In other experiments, the overnight incubation of 
C499's PBMC to remove adherent cells, CD4 + - and CD8 + -enriched populations were obtained 
by panning with monoclonal antibodies specific for the CDS* antigen or with CD8*-coated 
magnetic beads (Dynabeads; Robbins Scientific, Mountain View, CA). After removal of the 
CD8-' 771 cells, CD8 + lymphocytes were recovered from the plates or magnetic beads by 
additional overnight incubation and repeated washes with medium. After stimulation with 
ConA, cultures were established either with CD4*-enriched cells only or with CD4*- and 
CD8 + -enriched cells in various ratios. Percentages of CD4* and CD8 + cells in enriched 
populations were determined by analysis with FACScan (Becton-Dickinson, Mountain View, 

CA). 

Replication kinetics in PBMC and macrophages were tested as follows. 1 x 10 7 
freshly isolated or Con-A stimulated PBMC from HIV-1 negative chimpanzee were infected 
overnight (at 37°C) with 20 ng of the indicated virus (p24 antigen concentration). The cells 
were centrifuged at 1000 rpm for 10 min, resuspended in 10 ml complete RPMI medium 
fining 10% FBS and IL-2 (IL-2 medium) and were incubated at 37°C. Samples of 
supematants (1 ml) were harvested on days 3, 7, 10, 14 and 17 post infection. IL-2 medium 
was added to the cultures following the sampling to maintain the original volume. 
Supematants were used in RT assays to determine the relative amounts of virus produced. 



For replication in macrophages, cPBMC were resuspended in macrophage media (6 x 
lOVwell in RPMI 1640 containing 15% human serum [AB*]. 1% HEPES, 0.008 ng/ml GM- 
CSF, 0.03 ng/ml M-CSF, 1% antibiotic-antimycotic solution (Sigma, St. Louis, MO) and 
seeded in a 24-well plate and incubated at 37°C for 4 hrs. The cells were mixed by pipetting 
up and down before the incubation was continued for 4 days. Non-adherent cells were 
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removed by gently washing the wells. Fresh medium (2 ml/well) was added and the cells 
cultured for an additional 3 days to allow full macrophage differentiation. Infections 
initiated by adding 10 ng of virus (p24) to the cells in 500 ul media and adsorbed 
might. The inoculum was removed and the cells were washed twice before fresh 
macrophage medium (2 ml) was added. On days 7 and 14 post infection, aliquots of 0.5 ml 
were taken for determination of p24 antigen levels using the HIV-1 p24 Antigen kit (Coulter 
Corp, Miami, FL) according to the manufacturer's instructions. 

Example 6. Cell-mediated Immune Responses 

Lymphocyte proliferative responses to mitogens were tested by incorporation of 
[>H]thymidine into the DNA of PBMC seeded in triplicate at 5 x 10< PBMC per well into 96- 
well plates. PBMC were incubated for four days with different concentrations of 
phytohemagglutinin (PHA) or ConA, pulsed overnight with 1 uCi of [ 3 H]thymidine per well, 
and harvested with a cell harvester (Skatron, Sterling, V A); counts per minute (cpm) 
incorporated were determined in a P counter. 

Cytotoxic T lymphocyte (CTL) responses against KV-1 em- and gag-encoded 
antigens were assayed using fresh PBMC from C499 as effector cells and autologous Epstein 
Barr virus-transformed B cells as target cells. Target cells were infected at MOI=10 with 
recombinant vaccinia viruses expressing HTV-1 gpl 20 env of P 24 gag or with wild-type 
vaccinia as control. Infected targets were labeled with 200 uCi of [* Cr] sodium chromate 
(Du Pont, Boston) for 2 hours and washed, and 10 5 target cells (T) were mixed with effector 
cells (E) at various ratios in 96-well plates. The plates ere incubated for 6 hours at 37°C in 
5% C0 3 , then 0.1 ml of medium was removed from each well and counted in a y counter. 
Maximum release (max) was determined by adding 1% Triton X to labeled target cells, and 
spontaneous release (spon), by adding only medium. Percentage of specific lysis was 
determined: [cpm(experimental) - cpm (spon)]/[cpm(max) - cpm(spon)] x 100. 

Example 7. Nnnhnman Prim ate Model Systems for AIDS 

Nonhuman primates infected with HIV-1 isolate JC or infected with HTV-1 NC are 
useful as a model system for the study of AIDS. Chimpanzees and other monkey species 
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used for this purpose are preferably specific pathogen-free animals, which are available from 
primate centers, e.g., the Yerkes Regional Primate Research Center, Emory University, 
Atlanta, GA. 

Infected primates are preferably maintained as a single colony of two or more animals, 
all inoculated with HTV-1 isolate JC or a colony of two or more animals, all inoculated with 
HTV-1 NC . A colony may be maintained in a single room with each primate housed in an 
appropriate cage, in accordance with standard practices for the maintenance of animals 
established by the Animal Welfare Act and the NIH guide for care and use of laboratory 
animals. 

The primates are infected with the HIV-1, C virus or the HIV-1 NC virus by any suitable 
means, such as intraperitoneal, intravenous or subcutaneous injection with a solution 
containing HIV-1 isolate JC or fflV-l NC . The solution may also be a body fluid or tissue 
(e.g., blood) from a previously infected primate, a blood fraction containing peripheral blood 
mononuclear cells from a previously infected primate, a pharmaceutical^ acceptable carrier 
such as saline solution containing fflV-1 isolate JC or HIV-1 isolate NC, etc. 

Nonhuman primates infected with HTV- JC or HTV-1 NC are particularly useful as a 
model system for AIDS because of the concomitant decrease in CD4 + cells and increase in 
HTV-1 loads in plasma. The development of AIDS has not been previously documented in 
any nonhuman primate species. When used as a model system, a primate(s) infected with 
HTV-ljc or HIV-1 NC virus is subjected to a treatment useful in combating AIDS in humans 
and thereafter the progress of the infection and related diseases is monitored. A control 
(placebo) group of HTV-1 JC or HTV-1 NC -infected animals is left untreated for comparative 
purposes. A slowing in the progression of the development of AIDS in infected animals 
indicates that the treatment may be useful for combating AIDS in humans and additional 
screening and topological testing is prescribed. Such treatment includes but is not limited 
to a vaccine, a drug (e.g., an antiretroviral compound) or a drug combination (e.g., antiviral 
nucleosides such as AZT, DDI, etc.), a peptide, a protein, etc.) or a vaccine/drug combination, 
etc. After treatment initiation, the progress of the disease is monitored by any suitable 
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parameters including, but not limited to, (a) decline in CD4* cell levels, (b) increase in viral 
loads in plasma, (c) presence of fflV-l JC virus or HIV-1 NC virus, (d) weight loss, (e) general 
appearance, and other symptoms characteristic of AIDS. 

Example 8. Virus Cloning and Sequence Analysis 

cPBMC infected with either HIV-1 JC or HIV-1 NC were used for isolation of DNA 
using the Puregene Kit (Centra systems, Minneapolis, MM) as directed by the manufacturer. 
The strategy used for PCR amplification and primer location on the fflV-1 viral genome is 
shown in Figure 3. PCR primers MSF12 (S'-AAA TCT CTA GCA GTG GCG CCC GAA 
CAG-3', (SEQ ID NO:18); HIV-1 LAV nt 169 to 195) and MSR5 (5'-GCA CTC AAG GCA 
AGC TTT ATT GAG GCT-3', (SEQ ID NO:19); HIV-1 LAV nt 9225 to 9198) [Salminen et al. 
(1995) Virology 213, 80-86] were used to amplify a 9056 bp product from PBMC genomic 
DNA prepared from fflV-1 JC -infected cells. Another PCR primer pair 527 (5'-CAC ACA 
CAA GGC TAC TTC CCT GAT TGG CAG A-3\ SEQ ID NO:20, HTV-W nt 5302 to 
5274) was used to amplify 5' LTR-containing fragments (5699 bp) from the same DNA 
source For the HIV-1 NC viral genome, PCRprimer pairs 527-528 and 529 (5VATG GAA 
CAA GCC CCA GAA GAC CAA GGG CCA CAW. SEQ ID NO:21, HIV- W nt 5141 to 
5173) and 530 (S'-GOT CTG AGO GAT CTC TAG TTA CCA GAG TCA C-3\ SEQ ID 
NO-22, HIV-l LA v nt 151 to 121) were used to generate the 5'-half (5699 bp) and 3'-half (4142 
bp) PCR products respectively from HIV-1 NC genomic PBMC DNA. Primers were 
synthesized on an Applied Biosystems 392 DNA synthesizer (Applied Biosystems, Foster 
City, CA). Briefly, PCR was performed using the reagents from the Expand Long Template 
kit (Boehringer Mannheim, Indianapolis, IN) and 200 ng of DNA template, according to the 
manufacturer's instructions. After an initial DNA denaturation of 94°C for 2 min, the PCR 
consisted of 10 cycles of 94°C for 15s, 61X for 30s, 68°C for 8 min followed by 20 cycles 
of 94°C for 15s, 61 °C for 30s, and 68°C for 8 min with a 5 second addition to each 
extension. The samples were incubated at 72X for 30 min after the last cycle and then 
cooled to4°C. Results ofPCR reactions were evaluated on 0.9% agarose gels. PCR 
products representing the correct sized fragments were isolated from agarose and were 
directly cloned into the pCR II plasmid and amplified in Escherichia coli bactena (TA 
cloning kit, Invitrogen Corp., San Diego, CA) according to the manufacturer's protocol. 
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Single bacterial colonies containing plasmids with inserts of the correct size were grown at 
30°C overnight and plasmid DNA prepared by the alkaline lysis method. 

The strategy for preparing full-length molecular clones is illustrated in Figure 3. 
Several restriction enzymes were used to generate restriction maps for the positive clones. 
The Apa I fragment (1947 bp) from the 5' half PCR product was gel-purified and subcloned 
into the large fragment (7675 bp) of P JC using standard cloning procedures [Sambrook et al. 
(1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY] to generate plasmid pHIV-1 JC16 (Fig. 1). A chimeric plasmid 
(pHTV-W was generated by subcloning the PCR amplified 5' half of HIV-1 NC into pHIV- 
1 t For HIV-1 NC full-length clone, the Nco l-Xho I env-containing fragment from the 3' half 
PCR product was gel purified and subcloned into the Nco l-Xho I large fragment of plasmid 
pHIV-W containing the 5' half of HIV-1 NC . Multiple restriction enzymes were used for 
analysis of both viral DNAs to confirm the full length clones. 

5 x 10 6 CEMxl74 cells in T-25 flasks were transfected with 2 ug of either P HIV-1 JC16 
„ DNA in transfection buffer (25 mM Tris-HCl)(pH 7.5, 140 mM NaCl, 5 mM Kcl, 0.7 
"Z K 2 HP0 4 3H 2 0) containing 4 ul of DEAE-dextran (60 mg/ml) for 20 minutes at room 
temperature. Five milliliters of complete medium (RPM 1640 supplemented with 10% fetal 
bovine serum (FBS) and 2 mM L-glutamine) was added to stop the reaction followed by 
centrifugation of cells at 1000 rpm for 10 minutes. The cells were washed twice in 10 ml 
complete medium before they were .suspended in 10 ml complete medium, transferred to a 
T-25 cm' flask and incubated at 37°C (5% CO,). The cells were checked daily for cvtopathtc 
effects (syncytia formation) and aliquots of cultures were tested for the presence of reverse 
transcriptase (RT) activity using standard assay methods. For 293 cell lines, 2x10' cells m 
6-well plates were transfected with 2 ug of viral DNA using lipofectin (Life Technology 
Gaithersburg, MD) or DOTAP (Boehringer Mannheim, Indianapolis, IN) according to the 
nWacturers' instructions. After 24 hours, the transfected cells were overlaid w,th 2 x 
lOVwell of uninfected cPBMC previously stimulated with concanavalin A (Con-A) for 4 
days After an additional 2 day incubation, the non-adherent cell population (cPBMC) were 
transferred to a T-25 flask and additional stimulated cPBMC added for virus amplified 
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Culture supematants were assayed for RT activity and the cells were observed daily for 
development of syncytia. Cell-free stocks of molecularly cloned viruses were prepared at 
peak RT activity, aliquoted and stored under liquid nitrogen. 

Primers for sequencing were constructed from conserved regions of aligned sequences 
of HTV-1 LAI and fflV-l„ and were synthesized on an applied Biosystems 392 DNA 
synthesize, The DNA sequence of each full-length clone virus was determined by the 
dideoxy-chain termination method using the sequenase system (Amersham Life Sctences, 
Arlington Heights, IL) and »S-dATP. Nucleotide sequence alignments were performed w.th 
the Intelligenetics Suite of programs (Intelligenetics, Beaverton, OR) whild the phylogenetK 
analysis of amino acid sequence was done with CLUSTAL method (DNASTAR, Inc., 
Madison, WI). 

Example 9. Heteroduplex Mobility Assays. 

The nucleotide sequences for fflV-1 „. and HIV-1 NC7 have been assigned Genbank 
accession numbers AF049494 and AF049495 respectively. 

The heteroduplex mobility assay kit (NIH AIDS Research and Reference Reagent 
Program) based on the method described by Delwart et al. (1993) Science 262, 1257-1261 
was used. Briefly, equal amounts (5 ul each) of second-round PGR products (V1-V2 and V3- 
V5) from infected cPBMC genomic DNA were mixed with the reference PGR products to 
obtainheteroduplexes. After adding U ul of 1 Ox annealing buffer (1 MNaCl, lOOmMTns 
[pH 7 8], 20 mM EDTA), the mixed DNAs were denatured at 94<C for 2 min and then 
Lnealed by rapidly coolingmice. Three ul of loading dye (25% Ficoll, 1% Orange G) was 
added to the cooled DNA mixture and the samples loaded onto 5"/. polyacrylamide ge Un Ix 
TBE(88mMTns-bo ra te,89rnMbo* 

constant voltage of 250 V for 2.5 hrs. The gels were stamed with ethidium bromide and 
visualized under ultra-violet (UV) light. 
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TABLE 1 

Platelet counts, lymphocyte subset levels, and plasma HIV-1 loads in C499 




■ oocyte subset counts in peripheral b.ood were determined by FACScan anaiysis as described previous* (Ahmed- 
« ND°not doneTno P'« ma sam P |e was availab,e for mis da,e) - 
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T,b,= 2A. Al*-. rfWV-.c (SEQ ID NO:2, and BJ-kg £»£ NM > P "* 

Sequences. The consensus sequence corresponds to SEQ ID NU.o. 

H...B, ^,M,L<!m«-CSl-E-L»VTVYYOTP-«KE-TTTI^CMMK»V 

consensus MR--fcn 

HIVNDKENV « kk Ea!ULl^^ 

consensus --E-HN-WATHACVPTDPNPQEI-L-^E-P^-MVEQMH-DIISLWD-SLKPCVKL 

HIVJCENV »3 ™*™^^ 

consensus TPLCVTLNCTDlknEt--NSs--N Eimg-E- -KNCSFNV Y-Lf — 

HIVJCENV 194 "™ 

HIVNDKENV !74 pi JiLt^^ 

consensus . Iv .-- N .-N S T---L--C-TS--TO A CPK-S FE PXPIH-C MAG P A ILKC-DKKFN GTO P 

HIVJCENV 245 CnNVSTVQCTHGIRPWSTQLLI^GSvAEEEw IRSaNf sdNaKTIIVQI^SveltCTRP 
consensus C-NVSTVQCTHGIEPWSTQU.LNaS-AEEE-RS-N-N-KTIIVQI^-S-I-CTRP 

HXVJCBNV 30* nynetUcirlnr^rs^ ^^^'T^TT^^ 

HIVNDKENV 2 *S v^-U^iU^^ 

x Tgk--K-G QAHC R--W--AL-Q-A-KL NKT-I 

consensus x 

HIVJCENV 365 

consensus -P-SSGGO-EI-H-NCGG-^CNT-LPNSTWN-T T L PCRI-0-VN- 

HIVNDKENV «7 J^iS aNNs shETiRPGgGDMRDNWRSELYK 

consensus WQ-VGKAMYAPP- -G-I -C-SNITGLLLTRDGGhn-NN ET-RPG-GDMRDNWRSELYK 
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Table 2 A. (Continued) 

HIVJCENV „ •^ffSffS 
HIVNDKENV ,* 

hivjcenv m <™^™™r^nXrt™u™™™t 

HIVJCENV .,. --7--^^^ 
HIVHDKENV ... iJJJW JJlrlU^IA^^ 

co».«» S .M.WF.L.-LWYIK.FMIV-OL.^IVFAVLS.VM^YSPLSFvT-.-PRGPD 

hivjcenv m 

HIVHDKENV , U ILIUM^^ 

co»»». , p ..ie-e<k ! e R - E i.hs- K .v. 0 -.m..»- D ^ S ^--«--»««^ 

hivjcenv ,« «^™-™™^^^ 

II 111 1 IHMIIII Ml I [J 'U^LiivVOMCMILnvPERIHW 

HIVN0KEKV 780 «EU^»r™™SOELrNSM=LldTlAIWMrTl!!!WEVTO 

„„.^ S »«™-LLC^-«S»--L--T-»l-V«-™-^-^--'^ 

HIVJCENV 853 LERiLL 

Ml II 

HIVNDKENV 841 LERILL 



consensus LER-LL 



Alignment 



score = 541.00 
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Table 2B. Ahgnment of HIV- 1;C and HIV- 1 LAJ env Protein Sequences ; 

SEQ ID NO:4, respectively). The consensus sequence corresponds to fcty ID rsu. / . 

— x _ _w f^~~^ — ^ 
HIVlaienvt X liliLy!^^ 
consensus MRVKE- -QHLWKWGWKWG-MLLG-^ICSATEKLWVTVYYOVPVWKE-TTTLFCASDAKAY 



HIVJCENV 62 ee 



EVmTVWATHACVPTDPNPQEiVLaNVTEdF^ 

1 1 1 1 ! : 1 1 1 1! 1 1 1 1 i ; : II Mil Mil M i 1 1 1 1 IIMIMI 

HXVlaienvt S2 atEV^HACVP^^ 
consensus - - EVHNVWATHACVPTDPNPQE - VL -NVTE - FNMWKN - MVEQMH - D I ISLWD - SLKPCVKL 

HIVlaienvt 123 UicVsLcliUa^ 
consensus T p L CV-L-CTDL-N-T-™SS--NS-SGE-Mm---EIK N CSF N - S T---GKV Q K E Y--FY- 

HIVlaienvt X84 iil U^tUytlii^ViTQACPKVSFEPIPXHVCAPAGFAILKC^tFNGTG 
consensus LDIvs I N - - TS - - LTSCNTSV- TQACPKVSFEPIP IHYCAPAGFAILKCN- K- FKGTG 

HIVJCENV 244 

HIVlaienvt 243 ILtLlLlvQ^ 
consensus PC - NVSTVQCTHGIRPVVSTQIjLLNGS - AEEEVV- RSANF -DNAKTI IVQLN - SVEI - CTR 

HOT 30S ~™ 

HIVlaienvt 304 ^IrL^*^ 

PN-N--K- IRI -RG-GR-FVT- -K-G- -QAHC- --R-KW- LKQIASKLREQFgN- - - 1 



consensus 



HIVJCENV 3 6 s „™ - « SN enlT.PCRI 

HIVlaienvt 3S5 i^iwvtJS 

consensus IF- SSGGD-EI - -HSFNCGGE- FYCN- T - LFNSTWf N- TwsTEgSNn- -Gs - ITLPCRI 

HIVJCENV 420 ~~ " 
HIVlaienvt 426 k JlJJJ^^^ « NNngsEiFRPGgGDMRDNW 

-QF-NMWQ-VGKAMYAPP-GQIRC-SNITGLLLTRDGGhNdNN- - -E-FRPG-GDMRDNW 



consensus 
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HXVla.envt 485 iili!!^ iGAl FLGFLGAAGSTMGArSmTLTV 

consensus RSELYKYTCV-KIEPLGVAPTKAKIUiVVQREKRAVGm-GA-FLGFljGAAGSTMGA-S-TLTV 

HXVlaienv. 345 ^iiiiii^ 
consensus QARQLLSGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQAR-LAVERYLKDQQLLGIWGCSG 

HIVJCEN V 603 

HXVXaienvt 606 ^aVPV^ 
consensus KLICTT-VPVOlASWSNKSL-QIWNKMTW-EVJDREI-NYT-LIH-LIEESQNQQBKNEQBIiL 

HXVXaienvt 667 si^L^ 
consensus E^KWASLW-WF-I-^LWY^IFIMIV-GLVGLRIVFAVLSIVNRVRQGVSPLSFQTH-P 

HIVJCENV 7 2 5 
HXVXaienvC 7 2 8 ^rUILe^ 

consensus -prgpdrp-gie-eggerdrdrs-rlv-g-l^-w-du.-lclfsvhrlrdlllivtrive 

HIVJCENV ,36 -~rr n" 
HXVlaienvt 789 11^,11^^ 
consensus ^GV^^-W-IXQYWSQEL^ 

HIVJCENV 847 RRlRQGLERiLL 

miinii m 

HIVlaienvt 850 RRlRQGLERiLL 
consensus RR I RQGLE R - LL 

Alignment score « 657.00 
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Table 2C. Alignment of H1V-1 JC and HIV-1 SF2 env Protein Sequences (SEQ ID KO:2 *nd 
SEQ ID NO:5, respectively). The consensus sequence corresponds to bbQ 1U NU.5. 

kivjcemv , ^f^n n~m n c ™~ 

HIVsf2envt 1 MkVK gt r rnyqhl WrWGt 1 LLGMLMI cSATEKLWVTVYYGVPVWKEaTTTLFCASDAr AY 

consensus M-VKe- - -W-WG- -LLGMMICSATEKLWVTVYYGVPVVnCE-TTTLFCASDA-AY 

HIVJCENV 62 eeEVHNVWATHACVPTDPNPQE iVLaNVTEdFOTWKNeMVEQMhtDI I SLVTOeSLKPCVKL 

hivjcenv iiiiiiniiiiiiiiiiii ii mi nun mil i m m niiiim 

Hivsf 2 envt 61 dtl^AracvPTOPTO 

consensus - - EVHNVWATHACVPTDPNPQE -VL-NVTE- FNMWKN-MVEQM- -DIISLWD- SLKPCVKL 

HIVJCENV 123 TPLCVTLNCTDLkneTkTNSSdaNsnsgeimgneeikncsfnvstgapgKvqkeyalfyal 

HIVsf2envt 122 llLLvlliiiLLglcaLllii NwkeeikgeikncsfnittsirdkiqKenalf rnidw 



consensus TPLCVTLNCTDL T-TNSSdaN K 

HIVJCENV 134 -3-™™^ 
HIVsf2envt 181 pldnastttNyTnyrLihCNrSViTQACPKVSFEPIPIHYCtPAGFAILKCNnKtFNGkGP 

consensus -I N-T L--CN-SV- TQACPKVS FEPI PIHYC - PAGFAILKCN- K- FNG-GP 

HIVJCENV 245 CnNVSTVQCTHGIRPvVSTQLLLNGSvAEEEWlRS 

I I I I I I I I I I II I I II I II I I I I I I I I II II I INI 

HIVsf2env t 242 ctNVSWQCraGIRPiVSTQLLLNGSlAEE 

consensus C -NVSTVQCTHGIRP - VSTQLLLNGS - AEEEW- RS -NF - -NAKTI IVQLN- SV- I - CTRP 

HIVJCENV 306 f^*™^^ 
Hivsf2envt 303 UtrisiyigpG ra fhtTgRiiGDirkAHCni S Ra q WnNtLeQIvkKLRE Q F g NnktIv 



consensus 



N-N-K-I-I-Gy T-R--GD---AHC— R--W-N-L-QI- -KLREQFgN— I- 



HIVJCENV 366 FNrSSGGDIEIeMHSFNCgGElFYCNTTkLFNsTW netTEsngkgenltLPCRIrQfvNM 

1 1 1 1 1 1 1 i ii 1 1 1 1 1 1 1 1 1 1 m i ii i mil i ii 

H I Vsf2envt 363 FNqSSGGDpEIvMHSFNCrGEf FYC^ 
consensus 



FN-SSGGD-EI-MHSFNC-GE-FYCNTT-LFN-TWr---TE I-LPCRI-Q--NM 



HIVJCENV 426 WQkVGKAMYAPPsdGQIrCtSNITGLLLTRDGGhNdnNtnnEtFRPGrGDlTO 
HIVsf2env t 424 lUi^illigioLcsiNiiGLLL^tNvtN dtEvFRPGgGDMRDNWRS ELYK 
consensus „Q.VGKAMYAPP--GQI-C-SNITGLU.TRDGG-N--N t --E-FRPG-G D MRD N WRSELYK 
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Table 2C. (Continued) 

HI V3f2envt 484 YKVIKIEPLGiAPTKAKRRVVQREK^ 
consensus YKVIKIEPLG-APTKAKRRWQREKRAVG-VGAMFLGFLGAAGSTMGA- S LTLTVQ ARQLL 



HIVJCENV 548 SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARVIAyERYL^ 

HIVJCENV I 111 I I I I I I I I I I I | I I I I I I I I I I I I I I M I I I I I I I M I I M I I I I jjj. 

HIVsf.envt 545 iiiviii^ 
consensus S GI VQQQNNLLRAI EAQQHLLQL.TVWG I KQLQARVLAVERYL - DQQLLG I WGCSGKLI CTT 

~ SO, — — ™— 

Hivsf2envt 606 aV^wi*^ 
consensus -VPVR l ASW S NKSL-- I W- NW rW--W-REI-NYTN-I--L-EES QN Q Q E laJ E Q ELL E U)KWA 



HIVJCENV 67 0 



SLWsWFdlsNWLWYIKIFIMIVaGLVGLRIVFAV^I^^ 

ill M | 1 1 1 1 1 1 1 1 1 1 1 1 1 IIMIIIII IIIIIHIIII 1 1,1,1 

HIVsf2envt 667 Iii n iLi t iii4IriiiiiiU g iivG^ 

SLW-WF-I-NMLWYIKIFIMIV-GLVGLRIVFAVLSIVNRVRQGYSPLSFQT-P-PRGPD 



consensus 



HIVJCENV 731 RPDGIEgEGGERDRDRSVRLVIXSFLALlWEDLRnLCLFSYhR^ 

1 1 1 ii i 1 1 1 1 1 1 1 1 1 j 1 1 Ml 1 1 II M 1 1 i II 11 11 

HIVsf 2 envt 7 28 ilia^ER^^ 
consensus RP DGIE-EGGERDRDRSVRLVDGFLAI,-WEDLR-LCLFSY-RLRDLLLI-R-VE-LG-RG 

HIVJCENV 792 WEALKYlWSLLQYWsQELKNSAVnl f NtTAI vVaEGTDRi IEVvQRl cRAILHIpRRIRQG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MUM II 1 1 1 1 1 1 M 1 1 

HXVsf 2 envt 7 89 WEALKYwWSLLQYW^ 
consensus WEALKY - WSLLQYW -QELKNSAV N - TAI - V - EGTDR -IEV-QR-- RAILHI - RRIRQG 

HIVJCENV 853 LERiLL 
III II 

HIVsf2envt 850 LERILL 
consensus LER-LL 
Alignment score « 614.00 



53 



SUBSTITUTE SHEET (RULE 26) 



WO 98/59074 

Table 3. HIV-1 Antibody Response of C534 After Transfusion From C455 



PCT/US98/12990 



Days Post Infection 


Titer 


0 


0 


7 


0 


14 


400 


21 


1600 


28 


1600 
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Table 4. Comparative Analyses of Chimpanzee HIV-1 Isolates With Inoculating Viruses 



1 HIV-l JCI6 vs: 


Mutations 
unique to 
JC/NC 2 


1 LAI 


SF2 


NDK 


NC7 


Pol % homology 1 
insertions 
deletions 

point mutations | 


96.6 
12 
0 

30 


94.8 
0 
0 
51 


94.0 
0 

1 

56 


98.8 
0 
0 
11 


0 
0 
23 


Gag % homology 1 
insertions 
deletions 
point mutations 


94.2 
12 
0 

25 


92.2 
5 
3 

33 


88.8 
I 

81 
42 


97.4 
2 
0 
9 


0 
2 

20 


Vif % homology 1 
insertions 
deletions 

point mutations 1 


92.2 
0 
0 
14 


86.0 
0 
0 

26 


86.5 
0 
0 

25 


96.9 
0 
0 
5 


0 
0 
10 


Vpr % homology 1 
insertions 
deletions 

point mutations 1 


88.7 
0 
1 

7 


96.9 
0 
0 
2 


89.7 
0 
1 
6 


96.6 
0 
0 
2 


0 
0 
0 


Tat % homology 1 
insertions 
deletions 

point mutations 1 


88.4 
0 
15 
10 


82.5 
1 
0 
18 


74.4 
0 
15 
22 


97.0 
0 
0 
3 


0 
0 
4 


Nef % homology 1 
insertions 
deletions 

point mutations | 


87.4 
0 
0 
25 


89.4 
4 
0 
18 


78.7 
1 
0 

40 


99.0 
0 
0 
1 


0 
0 
8 


Rev % homology 1 
insertions 
deletions 

point mutations | 


84.5 
1 
I 

18 


82.9 
1 
0 
19 


75.2 
1 

0 

28 


96.6 
0 
0 
4 


0 
0 
11 


Env % homology 1 
insertions 
deletions 

point mutations | 


81.9 
9 
7 

138 


76.1 
9 
13 

173 


71.4 
11 
24 
203 


97.6 

0 

i 
i 

17 


0 
2 

78 


Vpu % homology 1 
insertions 
deletions 
point mutations 


79.3 
0 
1 

13 


63.9 
0 
1 

24 


67.1 
0 
1 

24 


98.8 
0 
0 
0 


0 
0 
10 


LTR % homology 1 
insertions 
deletions 
point mutations 


92.1 
1 
1 

48 


93.9 
3 
3 

29 


89.1 
1 
1 

52 


98.7 
0 
0 
10 


0 
1 

14 



■Amino acid homology 

'Amino acids or nucleotides not observed in parental viruses 
Nucleotide identity 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: EMORY UNIVERSITY 

(ii) TITLE OF INVENTION: Human Immunodeficiency Viruses 
Causing AIDS in a Nonhuman Primate 



(iii) NUMBER OF SEQUENCES: 23 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Greenlee, Winner and Sullivan, 

(B) STREET: 5370 Manhattan Circle, Suite 201 

(C) CITY: Boulder 

(D) STATE: Colorado 

(E) COUNTRY: US 

(F) ZIP : 80303 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: WO 

(B) FILING DATE: 23-JUN-1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/050,548 

(B) FILING DATE: 23-JUN-1997 

(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US SO/057,606 

(B) FILING DATE: 04-SEP-1997 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Ferber M. , Donna 

(B) REGISTRATION NUMBER: 33,878 

(C) REFERENCE / DOCKET NUMBER: 66-97 WO 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (303) 499-8080 

(B) TELEFAX: (303) 499-8089 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2577 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: not relevant 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..2577 



( X i) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATG AGA GTG AAG GAG AAC TGT CAG CAC TTG TGG AGA TGG GGG TGG AAA 
Zl Arg Val Lys Glu Asn Cys Gin His Leu Trp Arg Trp Gly Trp Lys 



5 10 



TGG GGC ATC ATG CTC CTT GGG ATG TTA ATG ATC TGT AGT GCT ACA GAA 
S GlY lie Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Thr Glu 
20 25 30 

AAA TTG TGG GTC ACA GTC TAT TAT GGG GTA CCT GTG TGG AAG GAA ACA 
I£ leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Thr 

40 4S 



35 



ACT ACC ACT CTA TTT TGT GCA TCA GAT GCT AAA GCA TAT GAA GAA GAG 
J£ Thr t£ Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Glu Glu 
50 55 60 

GTA CAT AAT GTT TGG GCC ACA CAT GCC TGT GTA CCC ACA GAC CCC AAC 
III Tis Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 



65 



70 



CCA CAA GAA ATA GTA TTG GCA AAT GTG ACA GAA GAT TTT AAC ATG TGG 
Pro Gin Glu lie Val Leu Ala Asn Val Thr Glu Asp Phe Asn Met Trp 



85 90 « 



AAA AAT GAA ATG GTA GAA CAG ATG CAT ACT GAT ATA ATC AGT TTA TGG 
Lys Tsn Glu Met Val Glu Gin Met His Thr Asp lie He Ser Leu Trp 
100 "5 "° 

GAT GAA AGC CTA AAA CCA TGT GTA AAA TTA ACC CCA CTC TGT GTT ACT 
Asp Glu Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu cys Val Thr 
115 120 125 

TTA AAT TGC ACT GAT TTG AAG AAT GAA ACT AAG ACC AAT AGT AGT GAT 
lIu "n Cys Thr Asp Leu Lys Asn Glu Thr Lys Thr Asn Ser Ser Asp 
130 "5 1" 

GCC AAT AGT AAT AGC GGG GAA ATA ATG GGG AAC GAA GAG ATA AAA AAT 
Ma £n Ser Asn Ser Gly Glu He Met Gly Asn Glu Glu lie Lys Asn 
145 ISO 155 

TGC TCT TTC AAT GTC AGC ACA GGC GCA CCA GGT AAG GTG CAG AAA GAA 
Zs S ^e Asn Val Ser Thr Gly Ala Pro Gly Lys Val Gin Lys Glu 

165 "0 175 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 
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TAT TCA CTT TTT TAT GCA CTT GAT ATA GTA TCA ATA AAG AAT GAA AAT 
™ IS Leu Phe Tyr Ala Leu Asp lie Val Ser lie Lys Asn Glu Asn 

1 185 190 



180 



AAT AGT ACC AGC CAT ATG TTG ACA AGT TGT AAC ACC TCA GTC AGT ACA 
£1 ser Thr Ser His Met Leu Thr Ser Cys Asn Thr Ser Val Ser Thr 



195 200 



™r rrr TGT CCA AAG GTA TCC TTT GAG CCA ATT CCC ATA CAT TAT TGT 
ofn S Z - £ Val Ser Phe Glu Pro He Pro lie His Tyr Cys 
210 215 220 

GCC CCG GCT GGT TTT GCA ATT CTA AAA TGT AAT GAT AAG AAG TTC AAT 
S S SI Gly Phe Ala lie Leu Lys Cys Asn Asp Lys Lys Phe Asn 

225 230 " 5 

~» »r» rra CCA TGT AAC AAT GTC AGC ACA GTA CAA TGT ACA CAT GGA 
Ty tS Gly Z 17s Z Asn Val Ser Thr Val Gin Cys Thr His Gly 
Y 245 250 255 

ATT AGA CCA GTA GTG TCA ACT CAA CTG CTG TTA AAT GGC AGT GTA GCA 
lie Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Val Ala 



260 



265 



GAA GAA GAG GTA GTA CTT AGA TCT GCC AAT TTC TCA GAC AAT GCT AAA 
gXu Glu S! Val Val Leu Arg Ser Ala Asn Phe Ser Asp Asn Ala Lys 

280 285 



275 



ACC ATA ATA GTA CAG CTG AAC CAC TCT GTA GAA ATT ACT TGT ACA AGA 
Thr Ue Te Val Gin Leu Asn His Ser Val Glu lie Thr Cys Thr Arg 
290 



295 300 



CCC AAC TAC AAT GAA ACA AAG AAA ATC CGT ATC CAC AGA GGA TAT GGA 
Pro ™ Tyr Asn Glu Thr Lys Lys He Arg lie His Arg Gly Tyr Gly 



305 310 



AGA TCA TTT GTT ACA GTA AGA AAA TTG GGA GAT AGG AAA CAA GCA CAT 
£ Ser Pnl v" Thr Val Arg Lys Leu Gly Asp Arg Lys Gin A a Hx. 

3 330 33b 



325 



TGT ACC ATG AAT AGA ACG AAA TGG GAC AAC GCT TTA AAA CAG ATA GCT 
lys T"hr Met Asn Arg Thr Lys Trp Asp Asn Ala Leu Lys Gin lie Ala 



340 



345 



AGC AAA TTA AGA GAA CAA TTT AAT AAA ACA GCA ATA ATC TTT AAC CGG 
S« i£ Zu S Glu Gin Phe Asn Lys Thr Ala lie lie Phe Asn Arg 

360 365 



355 



TCC TCA GGA GGG GAC CTA GAA ATT GAA ATG CAC AGT TTT AAT TGC GGA 
Ser ITr Gly Gly Asp Leu Glu lie Glu Met His Ser Phe Asn Cys Gly 
370 



375 3 8° 



GGG GAA TTG TTC TAC TGT AAT ACA ACA AAA CTG TTT AAT AGT ACT TGG 
Ty £u Zt Phe Tyr Cys Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



385 390 
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AAT GAG ACT ACA GAG TCA AAT GGC AAG GGA GAA AAT ATC ACA CTC CCA 
Asn Glu Thr Thr Glu Ser Asn Gly Lys Gly Glu Asn lie Thr Leu Pro 
405 410 415 



1248 



TGC AGA ATA AGA CAA TTT GTA AAC ATG TGG CAG AAA GTA GGA AAA GCA 
Cvs Arq He Arg Gin Phe Val Asn Met Trp Gin Lys Val Gly Lys Ala 
7 425 430 



420 



ATG TAT GCC CCT CCC AGC GAT GGA CAA ATT AGG TGT ACA TCA AAT ATT 
Met Tyr Ala Pro Pro Ser Asp Gly Gin lie Arg Cys Thr Ser Asn He 
435 440 445 

ACT GGG CTA CTA TTA ACA AGA GAT GGG GGT CAT AAT GAT AAC AAC ACT 
Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly His Asn Asp Asn Asn Thr 
450 4 *5 460 

AAC AAC GAG ACC TTC AGA CCG GGA AGA GGA GAT ATG AGG GAC AAT TGG 
Asn Asn Glu Thr Phe Arg Pro Gly Arg Gly Asp Met Arg Asp Asn Trp 

Anf\ 475 480 

465 470 

AGA AGT GAA TTA TAT AAA TAT AAA GTA ATA AAA ATT GAA CCA TTA GGA 
Arg Ser Glu Leu Tyr Lys Tyr Lys Val lie Lys He Glu Pro Leu Gly 
485 490 495 

GTA GCA CCC ACC AAG GCA AAG AGA AGA GTG GTG CAG AGA GAA AAA AGA 
Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg 
500 505 510 

GCA GTG GGA ATG GTA GGA GCT ATG TTC CTT GGG TTC TTG GGA GCA GCA 
Ala Val Gly Met Val Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala 
515 520 525 

GGA AGC ACT ATG GGC GCA GCG TCA TTG ACQ CTG ACG GTA CAG GCC AGA 
Gly Ser Thr Met Gly Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg 
530 535 540 

CAA TTA TTG TCT GGT ATA GTG CAG CAG CAG AAC AAT CTG CTG AGA GCT 
Gin Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala 
545 550 555 560 

ATT GAG GCG CAA CAA CAT CTG TTG CAA CTC ACA GTC TGG GGC ATC AAG 
He Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
565 570 575 

CAG CTC CAG GCA AGA GTC CTG GCT GTA GAA AGA TAC CTA AAG GAT CAA 
Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 

585 590 



580 



CAG CTC CTG GGG ATC TGG GGT TGC TCT GGA AAA CTC ATT TGC ACC ACT 
Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr Thr 
595 600 605 

ACT GTG CCT TGG AAT GCT AGT TGG AGT AAT AAA TCT TTG GAT CAG ATT 
Thr Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asp Gin He 
610 615 620 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



1776 



1824 



1872 
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TGG AAT AAC ATG ACC TGG TTG GAG TGG GAC AGA GAA ATT GCC AAT TAC 1920 
Trp Asn Asn Met Thr Trp Leu Glu Trp Asp Arg Glu He Ala Asn Tyr 
625 630 635 640 

ACA AAC TTA ATA CAT CAC TTA ATT GAA GAA TCG CAA AAC CAG CAA GAA 1968 
Thr Asn Leu He His His Leu He Glu Glu Ser Gin Asn Gin Gin Glu 
645 650 655 

AAG AAT GAA CAA GAA TTA TTG GAA TTA GAT AAA TGG GCA AGT TTG TGG 2016 
Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp 
660 665 670 

AGT TGG TTT GAC ATA TCA AAC TGG CTG TGG TAT ATA AAA ATA TTC ATA 2064 
Ser Trp Phe Asp He Ser Asn Trp Leu Trp Tyr lie Lys lie Phe lie 
675 680 685 

ATG ATA GTA GCA GGC TTA GTA GGT TTA AGA ATA GTT TTT GCT GTG CTT 2112 
Met lie Val Ala Gly Leu Val Gly Leu Arg He Val Phe Ala Val Leu 
690 695 700 

TCT ATA GTA AAT AGA GTT AGG CAG GGA TAC TCA CCA TTG TCA TTC CAG 2160 
Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin 
705 710 715 720 

ACC CAC TTC CCA GCT CCG AGG GGA CCC GAC AGG CCA GAC GGA ATC GAA 2208 
Thr His Phe Pro Ala Pro Arg Gly Pro Asp Arg Pro Asp Gly He Glu 
725 730 735 

GGA GAA GGT GGA GAG AGA GAC AGA GAC AGA TCC GTG CGA TTA GTG GAT 2256 
Gly Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Val Arg Leu Val Asp 
740 745 750 

GGA TTC TTA GCA CTT CTC TGG GAA GAC CTG CGC AAC CTG TGC CTC TTC 2304 
Gly Phe Leu Ala Leu Leu Trp Glu Asp Leu Arg Asn Leu Cys Leu Phe 
755 760 765 

AGC TAC CAC CGC TTG AGA GAC TTA CTC TTG ATT GTA ACG AGG ATT GTG 2352 
Ser Tyr His Arg Leu Arg Asp Leu Leu Leu He Val Thr Arg lie Val 
770 775 780 

GAA CTT CTC GGA CGC AGG GGG TGG GAA GCC CTC AAA TAT TTG TGG AGT 2400 
Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr Leu Trp Ser 
785 790 795 800 

CTC CTA CAG TAT TGG AGT CAG GAG CTA AAG AAT AGT GCT GTC AAC TTG 2448 
Leu Leu Gin Tyr Trp Ser Gin Glu Leu Lys Asn Ser Ala Val Asn Leu 
80S 810 815 

TTC AAT ACC ACA GCT ATA GTA GTA GCT GAG GGG ACA GAT AGG ATC ATA 2496 
Phe Asn Thr Thr Ala He Val Val Ala Glu Gly Thr Asp Arg He He 
820 825 830 

GAA GTA GTA CAA AGA CTT TGT AGA GCT ATT CTC CAC ATA CCT AGA AGA 2544 
Glu Val Val Gin Arg Leu Cys Arg Ala He Leu His He Pro Arg Arg 
835 840 845 
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ATT AGA CAG GGC TTG GAA AGA TTT TTG CTA TAA 2577 
lie Arg Gin Gly Leu Glu Arg Phe Leu Leu * 
850 855 



(2) INFORMATION FOR SEQ ID NO : 2 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 859 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Arg Val Lys Glu Asn Cys Gin His Leu Trp Arg Trp Gly Trp Lys 
1 5 10 15 

Trp Gly He Met Leu Leu Gly Met Leu Met He Cys Ser Ala Thr Glu 
20 25 30 

Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Thr 
35 40 45 

Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Glu Glu 
50 55 60 

Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65 70 75 80 

Pro Gin Glu He Val Leu Ala Asn Val Thr Glu Asp Phe Asn Met Trp 
85 90 95 

Lys Asn Glu Met Val Glu Gin Met His Thr Asp He He Ser Leu Trp 
100 105 HO 

Asp Glu Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr 
115 120 125 

Leu Asn Cys Thr Asp Leu Lys Asn Glu Thr Lys Thr Asn Ser Ser Asp 
130 135 140 

Ala Asn Ser Asn Ser Gly Glu He Met Gly Asn Glu Glu He Lys Asn 
145 150 155 160 

Cys Ser Phe Asn Val Ser Thr Gly Ala Pro Gly Lys Val Gin Lys Glu 
165 170 175 

Tyr Ser Leu Phe Tyr Ala Leu Asp lie Val Ser He Lys Asn Glu Asn 
180 185 190 

Asn Ser Thr Ser His Met Leu Thr Ser Cys Asn Thr Ser Val Ser Thr 
195 200 205 

Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He Pro He His Tyr Cys 
2io 215 220 
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Ala Pro Ala Gly Phe Ala He Leu Lys Cys Asn Asp Lys Lys Phe Asn 
225' 230 235 240 

Gly Thr Gly Pro Cys Asn Asn Val Ser Thr Val Gin Cys Thr His Gly 
245 250 255 

He Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Val Ala 
260 265 270 

Glu Glu Glu Val Val Leu Arg Ser Ala Asn Phe Ser Asp Asn Ala Lys 
275 280 285 

Thr He He Val Gin Leu Asn His Ser Val Glu He Thr Cys Thr Arg 
290 295 300 

Pro Asn Tyr Asn Glu Thr Lys Lys He Arg He His Arg Gly Tyr Gly 
305 310 315 320 

Arg Ser Phe Val Thr Val Arg Lys Leu Gly Asp Arg Lys Gin Ala His 
325 330 335 

Cys Thr Met Asn Arg Thr Lys Trp Asp Asn Ala Leu Lys Gin He Ala 
340 345 350 

Ser Lys Leu Arg Glu Gin Phe Asn Lys Thr Ala He He Phe Asn Arg 
355 360 365 

Ser Ser Gly Gly Asp Leu Glu He Glu Met His Ser Phe Asn Cys Gly 
370 375 380 

Gly Glu Leu Phe Tyr Cys Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp 
385 390 395 400 

Asn Glu Thr Thr Glu Ser Asn Gly Lys Gly Glu Asn He Thr Leu Pro 
405 410 415 

Cys Arg He Arg Gin Phe Val AsiTMet Trp Gin Lys Val Gly Lys Ala 
420 425 430 

Met Tyr Ala Pro Pro Ser Asp Gly Gin He Arg Cys Thr Ser Asn He 
435 440 445 

Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly His Asn Asp Asn Asn Thr 
450 455 460 

Asn Asn Glu Thr Phe Arg Pro Gly Arg Gly Asp Met Arg Asp Asn Trp 
465 470 475 480 



Arg Ser Glu Leu Tyr Lys Tyr Lys Val He Lys He Glu Pro Leu Gly 
485 490 495 

Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg 
500 505 510 

Ala Val Gly Met Val Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala 
515 520 525 



66 



SUBSTITUTE SHEET (RULE 26) 



WO 98/59074 PCT/US98/12990 

Gly Ser Thr Met Gly Ala Ala Ser Leu Thr Leu Thr Val Gin Ala Arg 
530 535 540 

Gin Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala 
545 550 555 560 

He Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys 
565 570 575 

Gin Leu Gin Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
580 585 590 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr Thr 
595 600 605 

Thr Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asp Gin He 
610 615 620 

Trp Asn Asn Met Thr Trp Leu Glu Trp Asp Arg Glu He Ala Asn Tyr 
625 630 635 640 

Thr Asn Leu He His His Leu He Glu Glu Ser Gin Asn Gin Gin Glu 
645 650 655 

Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp 
660 665 670 

Ser Trp Phe Asp He Ser Asn Trp Leu Trp Tyr He Lys He Phe He 
675 680 685 

Met He Val Ala Gly Leu Val Gly Leu Arg He Val Phe Ala Val Leu 
690 695 700 

Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin 
70S 710 715 720 

Thr His Phe Pro Ala Pro Arg Gly Pro Asp Arg Pro Asp Gly He Glu 
725 730 735 

Gly Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Val Arg Leu Val Asp 
740 745 750 

Gly Phe Leu Ala Leu Leu Trp Glu Asp Leu Arg Asn Leu Cys Leu Phe 
755 760 765 

Ser Tyr His Arg Leu Arg Asp Leu Leu Leu He Val Thr Arg He Val 
770 775 780 

Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr Leu Trp Ser 
785 790 795 800 

Leu Leu Gin Tyr Trp Ser Gin Glu Leu Lys Asn Ser Ala Val Asn Leu 
80S 810 815 

Phe Asn Thr Thr Ala He Val Val Ala Glu Gly Thr Asp Arg He He 
820 825 830 
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Glu Val Val Gin Arg Leu Cys Arg Ala He Leu His He Pro Arg Arg 
835 840 845 

He Arg Gin Gly Leu Glu Arg Phe Leu Leu * 
850 ' 855 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Arg Ala Arg Glu Lys Glu Arg Asn Cys Gin Asn Leu Trp Lys Trp 
15 10 15 

Gly He Met Leu Leu Gly Met Leu Met Thr Cys Ser Ala Ala Glu Asp 
20 25 30 

Leu Trp Val Thr Val Tyr Tyr Gly Val Pro He Trp Lys Glu Ala Thr 
35 40 45 

Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Lys Lys Glu Ala 
50 55 60 

His Asn He Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65 70 75 80 

Gin Glu He Glu Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys 
85 90 95 

Asn Asn Met Val Glu Gin Met His Glu Asp He He Ser Leu Trp Asp 
100 105 HO 

Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 

Asn Cys Thr Asp Leu Lys Asn Glu Thr Lys Thr Asn Ser Ser Asp Ala 
130 135 140 

Asn Ser Asn Ser Gly Glu lie Met Gly Asn Glu Glu He Lys Asn Cys 
145 150 155 160 

Ser Phe Asn Val Ser Thr Gly Ala Pro Gly Lys Val Gin Lys Glu Tyr 
165 170 175 

Ser Leu Phe Tyr Ala Leu Asp Asn Asn Asn Arg Thr Asn Ser Thr Asn 
180 185 190 
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Tvr Arg Leu lie Asn Cys Asp Thr Ser Thr He Thr Gin Ala Cys Pro 
y 195 200 205 

Lvs lie Ser Phe Glu Pro lie Pro lie His Phe Cys Ala Pro Ala Gly 
y 210 215 220 



Phe Ala lie Leu Lys Cys Arg Asp Lys Lys Phe Asn Gly Thr Gly Pro 

22S 230 23S 240 

CyS Ser Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val 

245 250 255 

Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu lie 

260 265 270 

lie He Arg Ser Glu Asn Leu Thr Asn Asn Val Lys Thr He lie Val 

275 280 285 

Ala Ser He Val He Asn Cys Thr Arg Pro Tyr Lys Tyr 



Gin Leu Asn 
290 



295 300 



Thr Arg Gin Arg Thr Ser He Gly Leu Arg Gin Ser Leu Tyr Thr He 
305 310 315 

Thr Gly Lys Lys Lys Lys Thr Gly Tyr He Gly Gin Ala His Cys Lys 
325 330 33S 

He Ser Arg Ala Glu Trp Asn Lys Ala Leu Gin Gin Val Ala Thr Lys 
340 345 350 

Leu Gly Asn Leu Leu Asn Lys Thr Thr He Thr Phe Lys Pro Ser Ser 
355 360 365 



Gly Gly Asp Pro Glu He Thr Ser His Met Leu Asn Cys Gly Gly Asp 
370 375 380 

Phe Phe Tyr Cys Asn Thr Ser Arg Leu Phe Asn Ser Thr Trp Asn Gin 



390 395 



385 

Ser Thr Gly Phe Asn Asn Gly Thr Val Thr Leu Pro Cys Arg 



Thr Asn 

405 



410 «" 



He Lys Gin He Val Asn Leu Trp Gin Arg Val Gly Lys Ala Met Tyr 

420 425 430 

Ala Pro Pro He Glu Gly Leu He Lys Cys Ser Ser Asn He Thr Gly 

435 440 445 

Leu Leu Leu Thr Arg Asp Gly Gly Ala Asn Asn Ser Ser His Glu Thr 
450 455 460 



lie Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu 
465 470 475 

Tyr Lys Tyr Lys Val Val Lys lie Glu Pro lie Gly Val Ala Pro Thr 
485 49° 495 
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Lys Ala Arg Arg Arg Val Val Glu Arg Glu Lys Arg Ala lie Gly Leu 



Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly 
515 



Glv Ala Val Phe Leu Gly Phe Leu uiy /ua «xc — 

520 52S 



Ala Ala Ser Val Thr Leu Thr Val Gin Ala Arg Gin Leu Met Ser Gly 
530 535 540 

lie val His Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin 
545 550 555 

His Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg 
565 570 575 

Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gin Gin Leu Leu Gly He 
580 585 590 

Trp Gly Cys Ser Gly Arg His He Cys Thr Thr Asn Val Pro Trp Asn 
59S 600 6° 5 

Ser Ser Trp Ser Asn Arg Ser Leu Asp Glu He Trp Gin Asn Met Thr 
610 "5 620 

Trp Met Glu Trp Glu Arg Glu He Asp Asn Tyr Thr Gly Leu He Tyr 

625 630 " S 

Ser Leu He Glu Glu Ser Gin He Gin Gin Glu Lys Asn Glu Lys Glu 
645 650 655 



L eu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Ser He 
660 665 670 

Thr Lys Trp Leu Trp Tyr He Lys Leu Phe He Met lie Val Gly Gly 



675 



680 685 



Leu He Gly Leu Arg He Val Phe Ala Val Leu Ser Val Val Asn Arg 

690 695 700 

Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Leu Pro Val 
705 710 715 

Pro Arg Gly Pro Asp Arg Pro Glu Glu lie Glu Glu Glu Gly Gly Glu 
725 730 7 " 

Arg Gly Arg Asp Arg Ser He Arg Leu Val Asn Gly Leu Phe Ala Leu 
740 745 75 ° 

Phe Trp Asp Asp Leu Arg Asn Leu Cys Leu Phe Ser Tyr His Arg Leu 

755 



755 



760 



Arg Asp ser He Leu He Ala Ala Arg He Val Glu Leu Leu Gly Arg 
770 775 780 

Arg Gly Trp Glu Ala Leu Lys Tyr Leu Trp Asn Leu Leu Gin Tyr Trp 
IOC 790 795 
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Ser Gin Glu Leu Arg Asn Ser Ala Ser Ser Leu Leu Asp Thr lie Ala 
805 810 815 

lie Ala val Ala Glu Arg Thr Asp Arg Val He Glu Val Val Gin Arg 
B20 825 830 

Ala Cys Arg Ala lie Leu Asn Val Pro Arg Arg lie Arg Gin Gly Leu 
835 840 845 

Glu Arg Leu Leu Leu Xaa 
850 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
<iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Lys 
! 5 10 15 

Trp Gly Thr Met Leu Leu Gly He Leu Met He Cys Ser Ala Thr Glu 



20 



25 30 



Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
35 40 *5 

Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 



50 



55 



60 



Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 



65 



Gin Glu val val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 



Pro 

85 



90 95 



Lys Asn Asp Met Val Glu Gin Met His Glu Asp He lie Ser Leu Trp 



100 



105 



Asp 



Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ser 



115 



120 



125 



Leu Lys Cys Thr Asp Leu Gly Asn Ala Thr Asn Thr Asn Ser Ser Asn 



130 



135 



140 



Thr Asn Ser Ser Ser Gly Glu Met Met Met Glu Lys Gly Glu lie Lys 

145 ^ 



150 155 
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" ash Cys ser Phe Asn lie Ser Thr Ser lie Arg Gly Lys Val Gin Lys 

165 170 

Glu T yr Ala Phe Phe Tyr Lys Leu Asp lie He Pro He Asp Asn Asp 
180 185 



Thr Thr Ser Tyr Thr Leu Thr Ser Cys Asn Thr Ser Val He Thr Gin 
19S 200 205 

A la Cys Pro Lys Val Ser Phe Glu Pro lie Pro lie His Tyr Cys Ala 

210 215 
pro Ala Gly Phe Ala lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly 
225 230 23S 

Thr Gly Pro Cys Thr Asn Val Ser Thr Val Gin Cys Thr His Gly lie 

245 250 

Pr o Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu 

260 265 

G lu Glu Val Val lie Arg Ser Ala Asn Phe Thr Asp Asn Ala Lys Thr 
275 280 285 

Ile lie Val Gin Leu Asn Gin Ser Val Glu lie Asa Cys Thr Arg Pro 



290 



295 



Asn Asn Asn Thr Arg Lys Ser He Arg He Gin Arg Gly Pro Gly Arg 

305 310 315 



Ala Phe Val Thr lie Gly Lys lie Gly Asn Met Arg Gin Ala His Cys 

325 330 

Asn lie Ser Arg Ala Lys Trp Asn Ala Thr Leu Lys Gin II. Ala Ser 

340 345 

Lys Leu Arg Glu Gin Phe Gly Asn Asn Lys Thr He He Phe Lys Gin 

355 3 «° 3 " 

ser Ser Gly Gly Asp Pro Glu He Val Thr His Ser Phe Asn Cys Gly 

370 375 380 

Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp 
385 390 395 

Phe Asn Ser Thr Trp Ser Thr Glu Gly Ser Asn Asn Thr Glu Gly Ser 
405 410 

Asp Thr lie Thr Leu Pro Cys Arg He Lys Gin Phe He Asn Met Trp 

420 425 

Gin Glu val Gly Lys Ala Met Tyr Ala Pro Pro lie Ser Gly Gin He 

435 "0 445 

Arg cys ser Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly 



450 



455 
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' Asn Asn Asn Asn Gly Ser Glu He Phe Arg Pro Gly aly Oly Asp Met 
465 "70 "75 



^ Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val vl Lys He 

Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin 

500 505 
^g Glu Lys Arg Ala Val Gly He Gly Ala Leu Phe Leu Gly Phe Leu 

515 520 
Gly Ala Ala Gly Ser Thr Met Gly Ala Arg Ser Met Thr Leu Thr Val 
530 535 

, » m« t«u Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu 
Gin Ala Arg Gin Leu Leu ber ^xy 5g(J 



545 550 



Leu Arg Ala He Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp 
565 570 

Gly He Lys Gin Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu 

580 585 



Ly s Asp Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He 

595 600 
Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 

610 615 
Glu Gin He Trp Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu He 
625 «° " 5 

Asn Asn Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn 

645 650 

Gin Gin- Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lyj Trp Ala 

660 665 
Ser Leu Trp Asn Trp Phe Asn He Thr Asn Trp Leu Trp Tyr He Lys 



675 



680 



He Phe He Met 
690 



lie Val Gly Gly Leu Val Gly Leu Arg lie Val Phe 



695 



700 



lie Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu 



Ala Val Leu Ser He val Asn ^ — — < ?20 



705 



710 



ser Phe Gin Thr His Leu Pro Thr Pro Arg Gly Pro Asp Arg Pro Glu 

725 730 

Gly He Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser He Arg 

740 745 
Leu Val Asn Gly Ser Leu Ala Leu He Trp Asp Asp Leu Arg Ser Leu 



760 765 
755 760 
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Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu He Val Thr 



770 



775 



^ Ile val Glu Leu Leu Gly Arg Arg Qly Trp Glu Ala Leu Lys Tyr 
785 790 7 " 

Trp Trp Asn Leu Leu Gin Tyr Trp Ser Gin Glu Leu Lys Asn Ser Ala 
805 810 

Val Ser Leu Leu Asn Ala Thr Ala Ile Ala Val Ala Glu Gly Thr Asp 

820 825 

Ar g val lie Glu val Val Gin Gly Ala Cys Arg Ala lie Arg His He 

840 84b 



835 



Pro Arg Arg He Arg Gin Gly Leu Glu Arg He Leu Leu Xaa 



850 



855 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 56 amino acids 
{B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Lys Val Lys Gly Thr Arg Arg Asn Tyr Gin His Leu Trp Arg Trp 
c 10 ■ L3 



Gly Thr Leu Leu Leu Gly Met Leu Mef lie Cys Ser Ala Thr Glu Lys 

20 25 

L eu Trp val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 



35 



Thr Thr Leu 
50 



Phe Cys Ala Ser Asp Ala Arg Ala Tyr Asp Thr Glu Val 
55 so 

His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 

65 ™ " 

Gin Glu val val Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys 

85 90 
Asn Asn Met Val Glu Gin Met Gin Glu Asp lie He Ser Leu Trp Asp 

i rt c .law 



100 105 



OXa ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 
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Asn cys Thr Asp Leu Gly Lys Ala Thr Asn Thr Asn Ser Ser Asn Trp 



130 135 1« 



Ly8 Gl« Qlu He Lys Gly Glu lie Lys Asn Cys Ser Phe Asn lie Thr 
145 150 155 

Thr Ser He Arg Asp Lys lie Gin Lys Glu Asn Ala Leu Phe Arg Asn 

165 1™ 17 



Leu 



Asp Val Val Pro He Asp Asn Ala Ser Thr Thr Thr Asn Tyr Thr 

180 185 

Asn Tyr Arg Leu lie His Cys Asn Arg Ser Val He Thr Gin Ala Cys 
195 200 205 

Pro Lys val Ser Phe Glu Pro He Pro He His Tyr Cys Thr Pro Ala 

■51 c 22 0 

210 21b 

Gly Phe Ala He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Lys Gly 
225 2 30 " 5 

Pro Cys Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro 

245 250 

Ue Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 

260 265 
val Val He Arg Ser Asp Asn Phe Thr Asn Asn Ala Lys Thr He He 



260 

ser Asd Asn Phe Thr Asn Asn Ala 

285 

val Gin Leu Asn Glu Ser Val Ala He Asn Cys Thr Arg Pro Asn Asn 



275 280 



295 



300 



290 

Asn Thr Arg Lys Ser He Tyr He Gly Pro Gly Arg Ala Phe His Thr 
305 310 315 

Thr Gly Arg He He Gly Asp He Arg Lys Ala His Cys Asn He Ser 

325 330 



Arg Ala Gin Trp Asn Asn Thr Leu Glu Gin He Val Lys Lys Leu Arg 

340 345 

Glu Gin Phe Gly Asn Asn Lys Thr He Val Phe Asn Gin Ser Ser Gly 
355 36° 365 

Gly Asp Pro Glu He Val Met His Ser Phe Asn Cys Arg Gly Glu Phe 
370 3'5 380 

Phe Tyr Cys Asn Thr Thr Gin Leu Phe Asn Asn Thr Trp Arg Leu Asn 
385 390 395 

His Thr Glu Gly Thr Lys Gly Asn Asp Thr He He Leu Pro Cys Arg 

405 410 

lie Lys Gin He He Asn Met Trp Gin Glu val Gly Lys Ala Met Tyr 



420 



425 



75 



SUBSTITUTE SHEET (RULE 26) 



PCT/US98/12990 

.WO 98/59074 

Ala Pro Pro lie Gly Gly Gin He Ser Cys Ser Ser Asn He Thr Gly 
435 440 445 



Leu Leu 
450 



Leu Thr Arg Asp Gly Gly Thr Asn Val Thr Asn Asp Thr Glu 
455 460 

val Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu 
465 470 

Leu Tyr Lys Tyr Lys Val He Lys He Glu Pro Leu Gly He Ala Pro 
485 490 495 

Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly 
500 S05 510 

Val Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 



He 

515 



520 525 



Met Gly Ala Val Ser Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
530 S35 540 

Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 

- «- <- 56 0 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
565 570 575 

Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gin Gin Leu Leu 



Ser 

545 



550 555 



5S0 



585 590 



Gly He Trp Gly Cys Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro 
595 600 605 



610 

Met 
625 



Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu Asp lie Trp Asp Asn 
610 6" 620 

Thr Trp Met Gin Trp Glu Arg Glu He Asp Asn Tyr Thr Asn Thr 

cm o40 



630 635 



He Tyr Thr Leu Leu Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu 
645 650 655 

Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
660 665 670 

ser He Thr Asn Trp Leu Trp Tyr lie Lys He Phe He Met He Val 
675 680 685 

Gly Gly Leu Val Gly Leu Arg He Val Phe Ala Val Leu Ser He Val 
690 695 700 

Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Arg Leu 



Asn 
705 



Pro 



710 



715 



val Pro Arg Gly Pro Asp Arg Pro Asp Gly He Glu Glu Glu Gly 



725 730 



735 
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Gly Glu Arg Asp Arg Asp Arg Ser Val Arg Leu Val Asp Gly Phe Leu 
740 745 750 

Ala Leu He Trp Glu Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr Arg 
755 760 765 

Arg Leu Arg Asp Leu Leu Leu lie Ala Ala Arg Thr Val Glu lie Leu 
770 775 780 

Gly His Arg Gly Trp Glu Ala Leu Lys Tyr Trp Trp Ser Leu Leu Gin 
785 790 795 800 

Tyr Trp He Gin Glu Leu Lys Asn Ser Ala Val Ser Trp Leu Asn Ala 
80S 810 815 

Thr Ala He Ala Val Thr Glu Gly Thr Asp Arg Val He Glu Val Ala 
820 825 830 

Gin Arg Ala Tyr Arg Ala He Leu His He His Arg Arg He Arg Gin 
835 840 845 

Gly Leu Glu Arg Leu Leu Leu Xaa 
850 855 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 854 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 1..854 

(D) OTHER INFORMATION: /note- "Xaa indicates residues 
which are not specified in the consensus sequence." 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Met Arg Xaa Xaa Glu Asn Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Trp Lys 
1 5 10 15 

Trp Gly He Met Leu Leu Gly Met Leu Met Xaa Ser Ala Xaa Glu Xaa 
20 25 30 

Leu Trp val Thr Val Tyr Tyr Gly Val Pro Xaa Lys Glu Thr Thr Thr 
35 40 45 

Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Xaa Xaa Glu Xaa His Asn 
50 55 60 
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Xaa Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu 
6S 70 75 80 

He Xaa Leu Xaa Asn Val Thr Glu Xaa Phe Asn Met Trp Lys Asn Xaa 
85 90 95 

Met Val Glu Gin Met His Xaa Asp He He Ser Leu Trp Asp Xaa Ser 
100 105 HO 

Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys 
115 120 125 

Thr Asp Leu Lys Asn Glu Thr Xaa Xaa Asn Ser Ser Xaa Xaa Asn Xaa 
130 135 140 

Xaa Xaa Xaa Glu He Met Gly Xaa Glu Xaa Xaa Lys Asn Cys Ser Phe 
14 5 150 155 160 

Asn Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa Leu 
165 170 175 

Phe Xaa Xaa Xaa Xaa He Val Xaa Xaa Xaa Asn Xaa Xaa Asn Ser Thr 
180 185 190 

Xaa Xaa Xaa Leu Xaa Xaa Cys Xaa Thr Ser Xaa Xaa Thr Gin Ala Cys 
195 200 205 

Pro Lys Xaa Ser Phe Glu Pro He Pro He His Xaa Cys Ala Pro Ala 
210 215 220 

Gly Phe Ala He Leu Lys Cys Xaa Asp Lys Lys Phe Asn Gly Thr Gly 
225 230 235 240 

Pro Cys Xaa Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro 
245 250 255 

Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Xaa Ala Glu Glu Glu 
260 265 270 

Xaa Xaa Xaa Arg Ser Xaa Asn Xaa Xaa Xaa Asn Xaa Lys Thr He He 
275 280 285 

Val Gin Leu Asn Xaa Ser Xaa Xaa He Xaa Cys Thr Arg Pro Xaa Xaa 
290 295 300 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
305 310 315 320 

Xaa Thr Gly Lys Xaa Xaa Lys Xaa Gly Xaa Xaa Xaa Gin Ala His Cys 
325 330 335 

Xaa Xaa Xaa Arg Xaa Xaa Trp Xaa Xaa Ala Leu Xaa Gin Xaa Ala Xaa 
340 345 350 

Lys Leu Xaa Xaa Xaa Xaa Asn Lys Thr Xaa lie Xaa Phe Xaa Xaa Ser 
355 360 365 
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Ser Gly Gly Asp Xaa Glu He Xaa Xaa His Xaa Xaa Asn Cys Gly Gly 
370 375 380 

Xaa Xaa Phe Tyr Cys Asn Thr Xaa Xaa Leu Phe Asn Ser Thr Trp Asn 
385 390 395 400 

Xaa Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Thr Leu Pro Cys 
405 410 415 

Arg He Xaa Gin Xaa Val Asn Xaa Trp Gin Xaa Val Gly Lys Ala Met 
420 425 430 

Tyr Ala Pro Pro Xaa Xaa Gly Xaa He Xaa Cys Ser Asn He Thr Gly 
435 440 445 

Leu Leu Leu Thr Arg Asp Gly Gly His Asn Xaa Asn Asn Xaa Xaa Xaa 
450 455 460 

Glu Thr Xaa Arg Pro Gly Xaa Gly Asp Met Arg Asp Asn Trp Arg Ser 
465 470 475 480 

Glu Leu Tyr Lys Tyr Lys Val Xaa Lys He Glu Pro Xaa Gly Val Ala 
485 490 495 

Pro Thr Lys Ala Xaa Arg Arg Val Val Xaa Arg Glu Lys Arg Ala Xaa 
500 505 510 

Met Gly Ala Xaa Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
515 520 525 

Gly Ala Ala Ser Xaa Leu Thr Val Gin Ala Arg Gin Leu Xaa Ser Gly 
530 535 540 

He Val Xaa Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin 
545 550 555 560 

His Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg 
565 570 575 

Val Leu Ala Val Glu Arg Tyr Leu Xaa Asp Gin Gin Leu Leu Gly He 
580 585 590 

Trp Gly Cys Ser Gly Xaa Xaa He Cys Thr Thr Xaa Val Pro Trp Asn 
595 600 605 

Xaa Ser Trp Ser Asn Xaa Ser Leu Asp Xaa He Trp Xaa Asn Met Thr 
610 615 620 

Trp Xaa Glu Trp Xaa Arg Glu He Xaa Asn Tyr Thr Xaa Leu He Xaa 
625 630 635 640 

Xaa Leu He Glu Glu Ser Gin Xaa Gin Gin Glu Lys Asn Glu Xaa Glu 
645 650 655 

Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Xaa Trp Phe Xaa He 
660 665 670 
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Xaa Xaa Trp Leu Trp Tyr He Lys Xaa Phe He Met He Val Xaa Gly 
675 680 685 

Leu Xaa Gly Leu Arg He Val Phe Ala Val Leu Ser Xaa Val Asn Arg 
690 695 700 

Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Xaa Xaa Pro Xaa 
705 710 715 720 

Pro Arg Gly Pro Asp Arg Pro Xaa Xaa He Glu Xaa Glu Gly Gly Glu 
725 730 735 

Arg Xaa Arg Asp Arg Ser Xaa Arg Leu Val Xaa Gly Xaa Xaa Ala Leu 
740 745 750 

Xaa Trp Xaa Asp Leu Arg Asn Leu Cys Leu Phe Ser Tyr His Arg Leu 
755 760 765 

Arg Asp Xaa Xaa Leu He Xaa Xaa Arg He Val Glu Leu Leu Gly Arg 
770 775 780 

Arg Gly Trp Glu Ala Leu Lys Tyr Leu Trp Xaa Leu Leu Gin Tyr Trp 
785 790 795 800 

Ser Gin Glu Leu Xaa Asn Ser Ala Xaa Xaa Leu Xaa Xaa Thr Xaa Ala 
805 810 815 

He Xaa Val Ala Glu Xaa Thr Asp Arg Xaa He Glu Val Val Gin Arg 
820 825 830 

Xaa Cys Arg Ala He Leu Xaa Xaa Pro Arg Arg He Arg Gin Gly Leu 
835 840 845 



Glu Arg Xaa Leu Leu Xaa 
850 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 866 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME /KEY : Region 

(B) LOCATION: 1. .866 

(D) OTHER INFORMATION: /note= "Xaa residues are not 
specified in this consensus sequence." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
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Met Arg Val Lys Glu Xaa Xaa Gin His Leu Trp Arg Trp Gly Trp Lys 

• I 5 10 15 

Trp Gly Xaa Met Leu Leu Gly Xaa Leu Met He Cys Ser Ala Thr Glu 
20 25 30 

Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Xaa 
35 40 45 

Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Xaa Xaa Glu 
50 55 60 

Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65 70 75 80 

Pro Gin Glu Xaa Val Leu Xaa Asn Val Thr Glu Xaa Phe Asn Met Trp 
85 90 95 

Lys Asn Xaa Met Val Glu Gin Met His Xaa Asp He He Ser Leu Trp 
100 105 HO 

Asp Xaa Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Xaa 
115 120 125 

Leu Xaa Cys Thr Asp Leu Xaa Asn Xaa Thr Xaa Thr Asn Ser Ser Xaa 
130 135 140 

Xaa Asn Ser Xaa Ser Gly Glu Xaa Met Met Xaa Xaa Xaa Glu He Lys 
145 150 155 160 

Asn Cys Ser Phe Asn Xaa Ser Thr Xaa Xaa Xaa Gly Lys Val Gin Lys 
165 170 175 

Glu Tyr Xaa Xaa Phe Tyr Xaa Leu Asp He Val Ser lie Xaa Xaa Xaa 
180 185 190 

Asn Xaa Xaa Thr Ser Xaa Xaa Leu Thr Ser Cys Asn Thr Ser Val Xaa 
195 200 205 

Thr Gin Ala Cys Pro Lys Val Ser Phe Glu Pro He Pro He His Tyr 
210 215 220 

Cys Ala Pro Ala Gly Phe Ala He Leu Lys Cys Asn Xaa Lys Xaa Phe 
225 230 235 240 

Asn Gly Thr Gly Pro Cys Xaa Asn Val Ser Thr Val Gin Cys Thr His 
245 250 255 

Gly He Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Xaa 
260 265 270 

Ala Glu Glu Glu Val Val Xaa Arg Ser Ala Asn Phe Xaa Asp Asn Ala 
275 280 285 

Lys Thr He He Val Gin Leu Asn Xaa Ser Val Glu He Xaa Cys Thr 
290 295 300 
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Arg Pro Asn Xaa Asn Xaa Xaa Lys Xaa He Arg He Xaa Arg Gly Xaa 
305 310 315 320 

Gly Arg Xaa Phe Val Thr Xaa Xaa Lys Xaa Gly Xaa Xaa Xaa Gin Ala 
325 330 335 

His Cys Xaa Xaa Xaa Arg Xaa Lys Trp Xaa Xaa Xaa Leu Lys Gin He 
340 345 350 

Ala Ser Lys Leu Arg Glu Gin Phe Gly Asn Xaa Xaa Xaa He He Phe 
355 360 365 

Xaa Xaa Ser Ser Gly Gly Asp Xaa Glu He Xaa Xaa His Ser Phe Asn 
370 375 380 

Cys Gly Gly Glu Xaa Phe Tyr Cys Asn Xaa Thr Xaa Leu Phe Asn Ser 
385 390 395 400 

Thr Trp Phe Asn Xaa Thr Trp Ser Thr Glu Gly Ser Asn Asn Xaa Xaa 
405 410 415 

Gly Ser Xaa Xaa He Thr Leu Pro Cys Arg He Xaa Gin Phe Xaa Asn 
420 425 430 

Met Trp Gin Xaa Val Gly Lys Ala Met Tyr Ala Pro Pro Xaa Xaa Gly 
435 440 445 

Gin He Arg Cys Xaa Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp 
450 455 460 

Gly Gly His Asn Asp Asn Asn Xaa Xaa Xaa Glu Xaa Phe Arg Pro Gly 
465 470 475 480 

Xaa Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys 
485 490 495 



Val Xaa Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg 
500 505 510 

Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Met Xaa Gly Ala Xaa 
515 520 525 

Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Xaa Ser 
530 535 540 

Xaa Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly He Val Gin 
545 550 555 560 

Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Leu Leu 
565 570 575 

Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Xaa Leu Ala 
580 585 590 

Val Glu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly He Trp Gly Cys 
595 600 605 
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Ser Gly Lys Leu He Cys Thr Thr Xaa Val Pro Trp Asn Ala Ser Trp 
610 615 620 

Ser Asn Lys Ser Leu Xaa Gin He Trp Asn Asn Met Thr Trp Xaa Glu 
625 630 635 640 

Trp Asp Arg Glu He Xaa Asn Thr Xaa Leu He His Xaa Leu He Glu 
645 650 655 

Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu 
660 665 670 

Asp Lys Trp Ala Ser Leu Trp Xaa Trp Phe Xaa He Xaa Asn Trp Leu 
675 680 685 

Trp Tyr He Lys He Phe He Met He Val Xaa Gly Leu Val Gly Leu 
690 * 695 700 

Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly 
705 710 715 720 

Tyr Ser Pro Leu Ser Phe Gin Thr His Xaa Pro Xaa Pro Arg Gly Pro 
725 730 735 

Asp Arg Pro Xaa Gly He Glu Xaa Glu Gly Gly Glu Arg Asp Arg Asp 
740 745 750 

Arg Ser Xaa Arg Leu Val Xaa Gly Xaa Leu Ala Leu Xaa Trp Xaa Asp 
755 760 765 

Leu Arg Xaa Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu 
770 775 780 

Leu He Val Thr Arg He Val Glu Leu Leu Gly Arg Arg Gly Trp Glu 
785 790 795 800 

Ala Leu Lys Tyr Xaa Trp Xaa Leu Leu Gin Tyr Trp Ser Gin Glu Leu 
805 810 815 

Lys Asn Ser Ala Val Xaa Leu Xaa Asn Xaa Thr Ala He Xaa Val Ala 
820 825 830 

Glu Gly Thr Asp Arg Xaa He Glu Val Val Gin Xaa Xaa Cys Arg Ala 
835 840 845 

He Xaa His He Pro Arg Arg He Arg Gin Gly Leu Glu Arg Xaa Leu 
850 855 860 

Leu Xaa 
865 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 836 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
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(D) TOPOLOGY: not relevant 
(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(ix) FEATURE: 

(A) NAME /KEY : Region 

(B) LOCATION: 1. .836 

(D) OTHER INFORMATION: /note- "Xaa residues are not 
specified in this consensus sequence." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Val Lys Glu Trp Trp Gly Leu Leu Gly Met Leu Met He Cys Ser 
15 10 15 

Ala Thr Glu Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp 
20 25 30 

Lys Glu Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Ala Tyr Glu Val 
35 40 45 

His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
50 55 60 

Gin Glu val Leu Asn Val Thr Glu Phe Asn Met Trp Lys Asn Met Val 
65 70 75 80 

Glu Gin Met Asp He He Ser Leu Trp Asp Ser Leu Lys Pro Cys Val 
85 90 95 

Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Thr Asp Leu Xaa Xaa 
100 105 HO 

Xaa Thr Xaa Thr Asn Ser Ser Asp Ala Asn Xaa Xaa Xaa Xaa Xaa Xaa 
115 120 125 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
130 135 140 

Xaa Xaa Xaa Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
145 150 155 160 

He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Xaa Thr Xaa Xaa Xaa Leu Xaa 
165 170 175 

Xaa Cys Asn Xaa Ser Val Xaa Thr Gin Ala Cys Pro Lys Val Ser Phe 
180 185 190 

Glu Pro He Pro He His Tyr Cys Xaa Pro Ala Gly Phe Ala He Leu 
195 200 205 

Lys Cys Asn Xaa Lys Xaa Phe Asn Gly Xaa Gly Pro Cys Xaa Asn Val 
210 215 220 
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Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Xaa Val Ser Thr Gin 
225 230 235 240 

Leu Leu Leu Asn Gly Ser Xaa Ala Glu Glu Glu Val Val Xaa Arg Ser 
245 250 255 

Xaa Asn Phe Xaa Xaa Asn Ala Lys Thr He He Val Gin Leu Asn Xaa 
260 265 270 

Ser Val Xaa He Xaa Cys Thr Arg Pro Asn Xaa Asn Xaa Xaa Lys Xaa 
275 280 285 

He Xaa He Xaa Xaa Gly Tyr Xaa Xaa Xaa Xaa Xaa Thr Xaa Arg Xaa 
290 295 300 

Xaa Gly Asp Xaa Xaa Xaa Ala His Cys Xaa Xaa Xaa Arg Xaa Xaa Trp 
305 310 315 320 

Xaa Asn Xaa Leu Xaa Gin lie Xaa Xaa Lys Leu Arg Glu Gin Phe Gly 
325 330 335 

Asn Xaa Xaa Xaa He Xaa Phe Asn Xaa Ser Ser Gly Gly Asp Xaa Glu 
340 345 350 

He Xaa Met His Ser Phe Asn Cys Xaa Gly Glu Xaa Phe Tyr Cys Asn 
355 360 365 

Thr Thr Xaa Leu Phe Asn Xaa Thr Trp Arg Xaa Xaa Xaa Thr Glu Xaa 
370 375 380 

Xaa Xaa Xaa Xaa Xaa Xaa He Xaa Leu Pro Cys Arg He Xaa Gin Xaa 
385 390 395 400 

Xaa Asn Met Trp Gin Xaa Val Gly Lys Ala Met Tyr Ala Pro Pro Xaa 
405 410 415 

Xaa Gly Gin He Xaa Cys Xaa Ser Asn He Thr Gly Leu Leu Leu Thr 
420 425 430 

Arg Asp Gly Gly Xaa Asn Xaa Xaa Asn Thr Xaa Xaa Glu Xaa Phe Arg 
435 440 445 

Pro Gly Xaa Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
450 455 460 

Tvr Lys Val He Lys He Glu Pro Leu Gly Xaa Ala Pro Thr Lys Ala 
465 470 475 480 

Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly Xaa Val Gly 
485 490 495 

Ala Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 
500 505 510 

Xaa Leu Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly He Val 
515 520 525 
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Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His Leu 
530 535 540 

Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu 
545 550 555 560 

Ala Val Glu Arg Tyr Leu Xaa Asp Gin Gin Leu Leu Gly He Trp Gly 
565 570 575 

Cys Ser Gly Lys Leu He Cys Thr Thr Xaa Val Pro Trp Asn Ala Ser 
580 585 590 

Trp Ser Asn Lys Ser Leu Xaa Xaa He Trp Xaa Aan Met Thr Trp Xaa 
595 600 60S 

Xaa Trp Xaa Arg Glu He Xaa Asn Tyr Thr Asn Xaa He Xaa Xaa Leu 
610 615 620 

Xaa Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu 
625 630 635 640 

Glu Leu Asp Lys Trp Ala Ser Leu Trp Xaa Trp Phe Xaa He Xaa Asn 
645 650 655 

Trp Leu Trp Tyr He Lys He Phe He Met He Val Xaa Gly Leu Val 
660 665 670 

Gly Leu Arg He Val Phe Ala Val Leu Ser He Val Asn Arg Val Arg 
675 680 685 

Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Xaa Xaa Pro Xaa Pro Arg 
690 695 700 

Gly Pro Asp Arg Pro Asp Gly He Glu Xaa Glu Gly Gly Glu Arg Asp 
705 710 715 720 

Arg Asp Arg Ser Val Arg Leu Val Asp Gly Phe Leu Ala Leu Xaa Trp 
725 730 735 

Glu Asp Leu Arg Xaa Leu Cys Leu Phe Ser Tyr Xaa Arg Leu Arg Asp 
740 745 750 

Leu Leu Leu He Xaa Xaa Arg Xaa Val Glu Xaa Leu Gly Xaa Arg Gly 
755 760 765 

Trp Glu Ala Leu Lys Tyr Xaa Trp Ser Leu Leu Gin Tyr Trp Xaa Gin 
770 775 780 

Glu Leu Lys Asn Ser Ala Val Xaa Xaa Xaa Asn Xaa Thr Ala He Xaa 
785 790 795 800 

Val Xaa Glu Gly Thr Asp Arg Xaa He Glu Val Xaa Gin Arg Xaa Xaa 
805 810 815 

Arg Ala He Leu His He Xaa Arg Arg He Arg Gin Gly Leu Glu Arg 
820 825 830 
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Xaa Leu Leu Xaa 
835 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CCCTTCGAAG AGGATATAAT CAGTTTATGG GATCAAAGC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCCTTCGAAC TCTTCTTCTG CTAGACTGCC ATT 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 9193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGTCTCTCTG GTTAGACCAG ATCTGAGCCT GGGAGCTCTC TGGCTAGCTA GGGAACCCAC 
TGCTTAAGCC TCAATAAAGC TTGCCTTGAG TGCTTCAAGT AGTGTGTGCC CGTCTGTTGT 120 
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GTGACTCTGG TAACTAGAGA TCCCTCAGAC CCTTTTAGTC AGTGTGGAAA AATCTCTAGC 180 

AGTGGCGCCC GAACAGGGAC CGGAAAGCAA AAGGGAAACC AGAGAAGCTC TCTCGACGCA 240 

GGACTCGGCT TGCTGAAGCG CGCACAGCAA GAGGCGAGGG GCGGCGACTG GTGAGTACGC 3 00 

CGATATTTTT GACTAGCGGA GGCTAGAAGG AGAGAGATGG GTGCGAGAGC GTCGGTATTA 360 

AGCGGGGGAG ATTTAGATCG ATGGGAAAAA ATTCGGTTAA GGCCAGGGGG AAAGAAAAAA 420 

TATATGTTAA AACATATAGT ATGGGCAAGC AGGGAGCTAG AACGATTCGC AGTCAATCCT 480 

GGCCTGTTAG AAACATCAGA AGGCTGTAGA CAAGTACTGG AACAGCTACA GCCATCCCTT 540 

CAGACAGGAT CAGAAGAACT TAGATCCTTA TATAATACAA TAGCAACCCT CTATGGTGTG 600 

CATCAAAAAA TAGAGGTAAA AGACACCAAG GAAGCTTTAG ACAAAATAGA GGAAGAGCAA 660 

AACAAAAGTA AGAAAAAAGC ACAGCAAGCA GCAGCTGACA CAGGACACAG CAGCCAGGTC 720 

AGCCAAAATT ACCCTATAGT GCAGAACCTA CAGGGGCAAA TGGTACATCA GGCCATATCA 780 

CCTAGAACTT TAAATGCATG GGTAAAAGTA ATAGAAGAGA AGGCTTTCAG CCCAGAAGTG 840 

ATACCCATGT TTTCAGCATT ATCAGAAGGA GCCACCCCAC AAGATTTAAA CACCATGTTA 900 

AACACAGTGG GGGGACATCA AGCAGCCATG CAAATGTTAA AAGAGACCAT CAATGAGGAA 960 

GCTGCAGAAT GGGATAGAGT GCATCCAGTG CAGGCAGGGC TTATTGCACC AGGCCAGATG 1020 

AGAGAACCAA GGGGAAGTGA CATAGCAGGA ACTACTAGTA CCCTTCAGGA ACAAATAGGA 1080 

TGGATGACAA GTAATCCACC TATCCCAGTA GGAGAAATTT ATAAAAGGTG GATAATCCTG 1140 

GGCTTAAATA AAATAGTAAG AATGTATAGC CCTATCAGCA TTCTAGACAT AAGACAAGGA 1200 

CCAAAAGAAC CCTTTAGAGA CTATGTAGAC CGGTTCTATA AAACTCTAAG AGCCGAGCAA 1260 

GCTTCACAGG AAGTAAAAAA TTGGATGACA GAAACCTTGC TGGTC CAAAA TGCGAACCCA 1320 

GATTGTAAGA CTATCTTAAA AGCATTAGGA CCAGGAGCTA CACTAGAAGA AATGATGACA 1380 

GCATGTCAGG GAGTGGGAGG ACCCGGCCAT AAGGCAAGAG TTTTAGCTGA AGCAATGAGC 1440 

CAAGTAACAA ATTCAGCTGC CATAATGATG CAGAGAGGCA ATTTTAAGAA CCAAAGAAAG 1500 

ATGGTTAAGT GTTTCAATTG TGGCAAAGAG GGGCACGTAG CCAGAAATTG CAGGGCCCCT 1560 

AGAAAAAAGG GCTGTTGGAA ATGTGGAAAG GAAGGACACC AAATGAAAGA TTGTATTGAG 1620 

AGCCAGGCTA ATTTTTTAGG GAAAATCTGG CCTTCCCACA AGGGAAGGCC AGGGAATTTC 1680 

CTTCAGAGCA GACCAGAGCC AACAGCCCCA CCAGAAGAGA GCCTCAGGTC TGGGATAGAG 1740 

ACAACAACTC CCTCTCAGAA GCAGGAGCCA ATAGACAAGG AAGTGTATCC TTTAACTTCC 1800 

CTCAGATCAC TCTTTGGCAA CGACCCCTCG TCACAATAAA GATAGGGGGG CAACTAAAGG i860 
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AAGCTCTATT AGATACAGGA GCAGATGATA CAGTATTAGA AGAAATGAAT TTGCCAGGAA 1920 

GATGGAAGCC AAAAATGATA GGGGGCATTG GAGGTTTTAT CAAAGTAAGA CAGTATGATC 1980 

AGATACCCAT AGAAATCTGT GGACATAAAG CTATAGGTAC AGTATTAGTA GGACCTACAC 2040 

CTGTCAACAT AATTGGAAGA AATCTGTTGA CTCAGATTGG TTGCACTTTA AATTTTCCCA 2100 

TTAGTCCTAT TGAAACTGTA CCAGTGAGAT TAAAGCCAGG AATGGATGGC CCAAAAGTTA 2160 

AACAATGGCC ATTGACAGAA GAAAAAATAA AAGCATTAGT AGAAATTTGT ACAGAAATGG 2220 

AAAAGGAAGG GAAAATTTCA AAAATTGGGC CTGAAAATCC ATACAATACT CCAGTATTTG 2280 

CCATAAAGAA AAAAGACAGT ACTAAATGGA GAAAATTAGT AGATTTCAGA GAACTTAATA 2340 

AGAGAACTCA AGACTTCTGG GAAGTTCAAT TAGGAATACC ACATCCCTCA GGTTTAAAAA 2400 

AGAAAAAATC AGTAACAGTA CTAGATGTGG GTGATGCATA TTTTTCAGTT CCCTTAGATG 2460 

AAGATTTCAG GAAGTATACT ACATTTACCA TACCTAGTAT AAACAATCAG ACACCAGGGA 2520 

TTAGATATCA GTACAATGTG CTTCCACAGG GATGGAAAGG ATCACCAGCA ATATTCCAAA 2580 

GTAGCATGAC AAAAATCTTA GAGCCTTTTA GAAAACAGAA TCCAGACATA GTTATCTATC 2640 

AATACATGGA TGATTTGTAT GTAGGATCTG ACTTAGAAAT AGAGCAGCAT AGAACAAAAA 2700 

TAGAGGAACT GAGACAGCAT CTGTTGAGGT GGGGATTTAC CACACCAGAC AAAAAACATC 2 760 

AGAAAGAACC TCCATTCCTC TGGATGGGGT ATGAACTCCA TCCTGATAAA TGGACAGTAC 2820 

AGCCTATAGT GCTGCCAGAA AAAGACAGCT GGACTGTCAA TGACATACAG AAGTTAGTGG 2880 

GAAAATTGAA TTGGGCAAGT CAGATTTACC CAGGGATTAA AGTAAGGCAA TTATGTAAAC 2940 

TCCTTAGAGG AGCCAAAGCA CTAACAGAAG TAATACCACT AACAGAAGGA GCAGAGCTAG 3000 

AACTGGCAGA AAACAGAGAG ATTCTAAAAG AACCAGTACA TGGAGTGTAT TATGACCCAT 3 060 

CAAAAGACTT AATAGCAGAA ATACAGAAGC AGGGGCAAGG CCAATGGACA TATCAAATTT 3120 

ATCAAGAGCC ATTTAAAAAT CTGAAAACAG GAAAATATGC AAGAACGAGG GGTGCCCACA 3180 

CTAATGATGT AAAACAATTA ACAGAGGCAG TGCAAAAAAT AGCCACAGAA AGCATAGTAA 3240 

TATGGGGGAA GACTCCTAAA TTTAAACTGC CCATACAAAA GGAAACATGG GAAACATGGT 3300 

GGACAGAGTA TTGGCAAGCC ACTTGGATTC CTGAATGGGA GTTTGTCAAT ACCCCTCCTT 3360 

TAGTAAAATT ATGGTACCAG TTAGAGAAAG AACCCATAGT AGGAGCAGAA ACTTTCTATG 3420 

TAGATGGGGC AGCTAACAGG GAGACTAAAT TAGGAAAAGC AGGATATGTT ACTAACAGAG 3480 

GAAGACAAAA GGTTGTCACC CTGACTAACA CAACAAATCA GAAGACTGAG TTACAAGCAA 3540 

TTCATCTAGC TTTACAGGAT TCAGGATCAG AAGTAAACAT AGTAACAGAC TCACAATATG 3600 
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CATTAGGAAT CATTCAAGCA CAACCAGATC AAAGTGAATC AGAGTTAGTC AATCAAATAA 3660 
TAGAGCAGTT AATAAAAAAG GAAAAGGTCT ATCTGGCATG GGTACCAGCA CACAAAGGAA 
TTGGAGGAAA TGAACAAGTA GATAAATTAG TCAGTGCTGG AATCAGAAAA GTACTATTTT 
TAGATGGAAT AGATAAGGCC CAAGAAGAAC ATGAGAAATA TCACAATAAT TGGAGAGCAA 
TGGCTAGTGA TTTTAACCTG CCACCTGTAG TAGCAAAAGA AATAGTAGCC AGCTGTGATA 
AATGTCAGCT AAAAGGAGAA GCTATGCATG GACAAGTAGA CTGTAGTCCA GGAATATGGC 
AACTAGATTG TACACATTTA GAAGGAAAAG TTATCCTGGT AGCAGTTCAT GTAGCCAGTG 
GATATATAGA AGCAGAAGTT ATTCCAGCAG AAACAGGGCA GGAAACAGCA TACTTTCTCT 
TAAAATTAGC AGGAAGATGG CCAGTAAAAA CAATACATAC AGACAATGGC AGCAATTTCA 
CCAGTGCTGC GGTTAAGGCC GCCTGTTGGT GGGCAGGAGT CAAACAAGAA TTTGGAATTC 
CCTACAATCC CCAAAGTCAA GGAGTAGTAG AATCTATGAA TAAAGAATTA AAGAAAATTA 
TAGGACAGGT AAGAGATCAA GCTGAACATC TTAAGACAGC AGTACAAATG GCAGTATTTG 
TCCACAATTT TAAAAGAAAA GGGGGGATTG GGGGGTACAG TGCAGGGGAA AGAATAATAG 
ACATAATAGC AACAGACATA CAAACTAGAG AACTACAAAA ACAAATTACA AAAATTCAAA 
ATTTTCGGGT TTATTACAGG GACAGCAGAG ATCCACTTTG GAAAGGACCA GCAAAGCTCC 
TCTGGAAAGG TGAAGGGGCA GTAGTAATAC AAGATAATAG TGACATAAAA GTAGTGCCAA 
GAAGAAAAGC AAAGATCATT AGGGATTATG GAAAACAGAT GGCAGGTGAT GATTGTGTGG 
CAAGTAGACA GGATGAGGAT TAGAACATGG AAAAGTTTAG TAAAACATCA TGTATATGTT 
TCAAAGAAAG CTAGGGGATG GTTTTATAGA CATCACTATG AAAGCACTCA TCCAAAAATA 
AGTTCAGAAA TACACATCCC ACTAGGGGAT GCTAGATTGG TAGTAACAAC ATATTGGGGT 
CTGCATACAG GAGAAAGAGA ATGGCATTTG GGTCATGGAG TCTCCATAGA ATGGAGGAAA 
AGGAGCTATA GCACACAAGT AGACCCTGAA CTAGCAGACC AACTAATTCA TCTGTATTAC 
TTTGATTGTT TTTCAGACTC TGCTATAAGA AAGGCCTTAT TAGAACACAT AGTTAGCCCT 
AGGTGTGAAT ATCGAGCAGG ACATTCCAAG GTAGGATCTC TACAATACTT GGCACTATCA 
GCCTTAATAA CACCAAAGAA GATAAAGCCA CCTTTGCCTA GTGTTACGAA ACTGACAGAG 
GATAGATGGA ACAAGCCCCA GAAGACCAAG GGCCACAGAG GGAGCCATAC AATGAATGGA 
CACTAGAGCT TTTAGAGGAG CTTAAGGGAG AAGCTGTTAG ACATTTTCCT AGGCCATGGC 
TCCATAGCTT AGGACAACAT ATCTATGAAA CTTATGGAGA TACTTGGGCA GGAGTGGAAG 
CCATAATAAG AATTCTGCAA CAATTGCTGT TTATTCATTT CAGAATTGGG TGTCAACATA 



3720 
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GCAGAATAGG CATTATTCAA CAGAGGAGAG CAAGAAGAAA TGGAGCCAGT AGATCCTAAT 5400 

CTAGAGCCCT GGAAGCATCC AGGAAGTCAG CCTAAAACTG CTTGTACCAA ATGCTATTGT 5460 

AAAAGGTGTT GCTTTCATTG CCAAGTTTGT TTCACAACAA AAGCCTTAGG CATCTCCTAT 5520 

GGCAGGAAGA AGCGGAGACA GCGACGAAGA CCTCGTCAGG GCAGCCAGGC TCATCAAGCT 5580 

TCTCTATCAG AGCAGTAAGT AGTATATGTA ATGCAACTTA TATTAATTGT AACAATAGTA 5640 

G CTTTAGTAG TAACATTAAT AATAGCAATA GTTGTGTGGT CCATAGTACT CATAGAATAT 5700 

AGGAAAATAT TAAGACAAAG GAAAATAGAC AAGTTAATTA ATAGACTAGT AGAAAGAGCA 5760 

GAAGACAGTG GCAATGAGAG TGAAGGAGAA CTGTCAGCAC TTGTGGAGAT GGGGGTGGAA 5820 

ATGGGGCATC ATGCTCCTTG GGATGTTAAT GATCTGTAGT GCTACAGAAA AATTGTGGGT 5880 

CACAGTCTAT TATGGGGTAC CTGTGTGGAA GGAAACAACT ACCACTCTAT TTTGTGCATC 5940 

AGATGCTAAA GCATATGAAG AAGAGGTACA TAATGTTTGG GCCACACATG CCTGTGTACC 6000 
CACAGACCCC AACCCACAAG AAATAGTATT GGCAAATGTG ACAGAAGATT TTAACATGTG 
GAAAAATGAA ATGGTAGAAC AGATGCATAC TGATATAATC AGTTTATGGG ATGAAAGCCT 
RAAACCATGT GTAAAATTAA CCCCACTCTG TGTTACTTTA AATTGCACTG ATTTGAAGAA 
TGAAACTAAG ACCAATAGTA GTGATGCCAA TAGTAATAGC GGGGAAATAA TGGGGAACGA 
AGAGATAAAA AATTGCTCTT TCAATGTCAG CACAGGCGCA CCAGGTAAGG TGCAGAAAGA 

ATATGCACTT TTTTATGCAC TTGATATAGT ATCAATAAAG AATGAAAATA ATAGTACCAG 6360 

CCATATGTTG ACAAGTTGTA ACACCTCAGT CAGTACACAG GCCTGTCCAA AGGTATCCTT 6420 

TGAGCCAATT CCCATACATT ATTGTGCCCC GGCTGGTTTT GCAATTCTAA AATGTAATGA 6480 

TAAGAAGTTC AATGGAACAG GACCATGTAA CAATGTCAGC ACAGTACAAT GTACACATGG 6540 

AATTAGACCA GTAGTGTCAA CTCAACTGCT GTTAAATGGC AGTGTAGCAG AAGAAGAGGT 6600 

AGTACTTAGA TCTGCCAATT TCTCAGACAA TGCTAAAACC ATAATAGTAC AGCTGAACCA 6660 

CTCTGTAGAA ATTACTTGTA CAAGACCCAA CTACAATGAA ACAAAGAAAA TCCGTATCCA 6720 

CAGAGGATAT GGAAGATCAT TTGTTACAGT AAGAAAATTG GGAGATAGGA AACAAGCACA 6 780 

TTGTACCATG AATAGAACGA AATGGGACAA CGCTTTAAAA CAGATAGCTA GCAAATTAAG 6840 

AGAACAATTT AATAATAAAA CAGCAATAAT CTTTAACCGG TCCTCAGGAG GGGACCTAGA 6900 

AATTGAAATG CACAGTTTTA ATTGCGGAGG GGAATTGTTC TACTGTAATA CAACAAAACT 6960 

GTTTAATAGT ACTTGGAATG AGACTACAGA GTCAAATGGC AAGGGAGAAA ATATCACACT 7020 

CCCATGCAGA ATAAGACAAT TTGTAAACAT GTGGCAGAAA GTAGGAAAAG CAATGTATGC 7080 



6060 
6120 
6180 
6240 
6300 
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CCCTCCCAGC GATGGACAAA TTAGGTGTAC ATCAAATATT ACTGGGCTAC TATTAACAAG 7140 
AGATGGGGGT CATAATGATA ACAACACTAA CAACGAGACC TTCAGACCGG GAAGAGGAGA 
TATGAGGGAC AATTGGAGAA GTGAATTATA TAAATATAAA GTAATAAAAA TTGAACCATT 
AGGAGTAGCA CCCACCAAGG CAAAGAGAAG AGTGGTGCAG AGAGAAAAAA GAGCAGTGGG 
AATGGTAGGA GCTATGTTCC TTGGGTTCTT GGGAGCAGCA GGAAGCACTA TGGGCGCAGC 
GTCATTGACG CTGACGGTAC AGGCCAGACA ATTATTGTCT GGTATAGTGC AGCAGCAGAA 
CAATCTGCTG AGAGCTATTG AGGCGCAACA ACATCTGTTG CAACTCACAG TCTGGGGCAT 
CAAGCAGCTC CAGGCAAGAG TCCTGGCTGT AGAAAGATAC CTAAAGGATC AACAGCTCCT 
GGGGATCTGG GGTTGCTCTG GAAAACTCAT TTGCACCACT ACTGTGCCTT GGAATGCTAG 
TTGGAGTAAT AAATCTTTGG ATCAGATTTG GAATAACATG ACCTGGATGG AGTGGGACAG 
AGAAATTGCC AATTACACAA ACTTAATACA TCACTTAATT GAAGAATCGC AAAACCAGCA 
AGAAAAGAAT GAACAAGAAT TATTGGAATT AGATAAATGG GCAAGTTTGT GGAGTTGGTT 
TGACATATCA AACTGGCTGT GGTATATAAA AATATTCATA ATGATAGTAG CAGGCTTAGT 
AGGTTTAAGA ATAGTTTTTG CTGTGCTTTC TATAGTAAAT AGAGTTAGGC AGGGATACTC 
ACCATTGTCA TTCCAGACCC ACrTCCCAGC TCCGAGGGGA CCCGACAGGC CAGACGGAAT 
CGAAGGAGAA GGTGGAGAGA GAGACAGAGA CAGATCCGTG CGATTAGTGG ATGGATTCTT 
AGCACTTCTC TGGGAAGACC TGCGCAACCT GTGCCTCTTC AGCTACCACC GCTTGAGAGA 
CTTACTCTTG ATTGTAACGA GGATTGTGGA ACTTCTCGGA CGCAGGGGGT GGGAAGCCCT 
CAAATATTTG TGGAGTCTCC TACAGTATTG GAGTCAGGAG CTAAAGAATA GTGCTGTCAA 
CTTGTTCAAT ACCACAGCTA TAGTAGTAGC TGAGGGGACA GATAGGATCA TAGAAGTAGT 
ACAAAGACTT TGTAGAGCTA TTCTCCACAT ACCTAGAAGA ATTAGACAGG GCTTGGAAAG 
GATTTTGCTA TAAGATGGGT GGCAAGTGGT CAAAAAGTAG TATAGTTGGA TGGCCTACTA 
TAAGGGAAAG AATGAAACGA GCTGGACCAG CAGCAGATGG GGTGGGAGCA GCATCTCGAG 
ACCTAGAAAA ACATGGAGCA ATCACAAGTA GCAATACAGC AGCTACCAAT GCTGATTGTG 
CCTGGCTAGA AGCACAAGAG GAGGAAGAGG TGGGTTTTCC AGTCAGACCT CAGGTACCTT 
TAAGACCAAT GACTTACAAG GCAGGTATAG ATCTTAGCCA CTTTTTAAAA GAAAAGGGGG 
GACTGGAAGG GCTAGTTTGG TCCCAAAGAA GACAAGATAT CCTTGATCTG TGGATCTACC 
ACACACAAGG CTACTTCCCT GATTGGCAGA ACTACACACC AGGGCCAGGG ATCAGATATC 
CACTGACCTT TGGATGGTGC TTCAAGCTAG TACCAGTTGA GCCAGATAAG GTAGAAGAGG 



7200 
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8880 


rrAATGAAGG 


AGAGGACAAC 


ATCTTGTTAC 


ACCCTATGTG 


TCTACATGGA 


ATGGAGGACG 


CGGAGAAAGA 


AGTGTTAGTG 


TGGAGATTTG 


ACAGTAAACT 


AGCCTTCCAT 


CACGTAGCCC 


8940 


GAGAGCTGCA 


TCCGGAGTAC 


TACAAAGACT 


GCTGACATCG 


AGCTTTCTAC 


AAGGGACTTT 


9000 


rCGCTGGGGA 


CTTTCCAGGG 


AGGCGTGATC 


TGGGTGGGAC 


TGGGGAGTGG 




9060 


GATGCTGCAT 


ATAAGCAGCT 


GCTTTTGCCT 


GTACTGGGTC 


TCTCTGGTTA 


GACCAGATCT 


9120 


GAGCCTGGGA 


GCTCTCTGGC 


TAGCTAGGGA 


ACCCACTGCT 


TAAGCCTCAA 


TAAAGCTTGC 


9180 
9193 


CTTGAGTGCT 


TCA 
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:cactgcttaagcc 
rtggtaactagaga 
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g aa « cccca "*"""aat catctt aa gacagcageacaaa tggcagtacctgtccacaattttaaa 
acaggcaagagaccaagctgaacaccccaagac g y ^^^^ acaaca ^ caacagacacacaaa 



actttggaaa g gaccagcaaagccccccci,_- a ». a _-. acggaaaacagacggcaggcgacgacc 



■ggaaaggtgaaggggcagtagtaatacaagataatagcgac 



atgaaacttatggagatact 



gcaagcagtatatgtaacgcaaccua * aggaaa aeattaagacaaaggaaaatagacaagt 

gcaacagctgcgcggtccatagtactcatagaatacagga * ta ccagca cttgt 

caatcaatagactagtagaaagagcagaagac^ 
ggagatgggggtggaaatggggcatcacgctc^ 

gtgggtcacagtctattacggggtacctgcgcggaagg a gtgta cccacagaccccaacc 
gccaaagcatacgaagaagaggcacacaacgttcgggccacac^ 

cacaagaaatagtattggcaaatgtgacagaag^ 

gcatactgatataaccagcctatgggacgaaagcctaaaaccatgtgca ggg 
actttaaattgcactgatctgaagaatgaaac^g^^ 

aaataacggggaac_g^ 




cacattattgtgccccggctgg^-y---------- » - aCTt . aatat : Ca actcaactgctgtta 



aatggcagtccagcagaagaagaggtagtact^ 
tagtacagccgaaccactctgcagaaaccacttgcac g g aa acaagcacaccgt 
tacccacagaggatacggaagaccacttgtcacagtaagaaaaccggg | ^ aCtcaaCa 
accacgaacagaacgaaacgggaoaacg^ 
aaacagcaacaatctttaaccggtccccagg^ggg^ 



9g < a !!!?!!""""c"g=ga^ 



^^^^^^^^ 



c 



r a «tcacttaattgaagaatcgcaaaaccagcaagaaaagaatgaacaagaaccattgga 

Sgrrs 



95 



SUBSTITUTE SHEET (RULE 26) 



PCT/US98/12990 

WO 98/59074 

ltggactcttagcactt 



"ggagccaggagctaaagaatagcgc g actctccacacacccaga agaactagac 
acagacaggatcatagaagcagtacaaaga^ 

agggcccggaaagacccttgctacaagacgggcgg « « agcatcccgagacctaga 
ccacaagggaaagaargaaacgagctggaccagcagcag ^ cc CC a ga agcacaa 

aaaacatcgagcaatcacaagtagcaatacagcagc^ 
gaggaggaagaggtgggctttccagtcagaccccaggt^ 



ca?cctcgatctgtggatctaccacaca = ^ 

gtgtggagatccgacagtaaactagccttccaccacg g * tcccagggaggcgtg 



agaagtgccagtgcggag^u^^-,.---- - cc cgccggggac tt C ccagggagg C g C g 

tactacaaagaccgctgacatcgagccttctaca ^ taagcagccgctctcgC ccgcaccgg 

caataaagcttgccttgagtgcttca 
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OR SEQ ID NO: 13 THROUGH SEQ ID NO:l7, PLEASE SEE TABLE 



(2) INFORMATION FOR SEQ ID NO: 18 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
AAATCTCTAG CAGTGGCGCC CGAACAG 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GCACTCAAGG CAAGCTTTAT TGAGGCT 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CACACACAAG GCTACTTCCC TGATTGGCAG A 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
ATGGAACAAG CCCCAGAAGA CCAAGGGCCA CAG 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "Oligonucleotide." 

(iii) HYPOTHETICAL: NO 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

GGTCTGAGGG ATCTCTAGTT ACCAGAGTCA C 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2404 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) . TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Met Glu Thr Ala Arg Gly Val Ala Leu Leu Tyr Ser Gly Leu Ala Ser 
! 5 10 15 

Asn Thr Tyr Arg Gly Leu Asn His lie Ser Leu Glu Thr Arg Pro Ala 
20 25 30 

Arg Gly Thr Arg Pro Gly Leu Tyr Thr Arg Pro Leu Tyr Ser Thr Arg 
35 40 45 

Pro Gly Leu Tyr lie Leu Glu Met Glu Thr Leu Glu Leu Glu Gly Leu 
50 55 60 



31 
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Tvr Met Glu Thr Leu Glu Met Glu Thr He Leu Glu Cys Tyr Ser Ser 

7n 75 80 



65 



Glu Arg Ala Leu Ala Thr His Arg Gly Leu Leu Tyr Ser Leu Glu Thr 
85 9° 95 

Arq Pro val Ala Leu Thr His Arg Val Ala Leu Thr Tyr Arg Thr Tyr 
100 105 11° 

Arg Gly Leu Tyr Val Ala Leu Pro Arg Val Ala Leu Thr Arg Pro Leu 
115 120 I 25 

Tyr Ser Gly Leu Ala Leu Ala Thr His Arg Thr His Arg Thr His Arg 



130 



135 14° 



Glu Pro His Glu Cys Tyr Ser Ala Leu Ala Ser Glu Arg Ala Ser 



Leu 

145 150 



155 16° 



Pro Ala Leu Ala Leu Tyr Ser Ala Leu Ala Thr Tyr Arg Gly Leu Gly 
165 l™ i7S 

Leu Gly Leu Val Ala Leu His lie Ser Ala Ser Asn Val Ala Leu Thr 
180 185 190 

Arg Pro Ala Leu Ala Thr His Arg His lie Ser Ala Leu Ala Cys Tyr 
1 95 200 205 

Ser Val Ala Leu Pro Arg Thr His Arg Ala Ser Pro Pro Arg Ala Ser 
210 215 220 

Asn Pro Arg Gly Leu Asn Gly Leu He Leu Glu Val Ala Leu Leu Glu 
225 230 235 240 

Ala Leu Ala Ala Ser Asn Val Ala Leu Thr His Arg Gly Leu Ala Ser 
245 250 255 

Pro Pro His Glu Ala Ser Asn Met Glu Thr Thr Arg Pro Leu Tyr Ser 
260 265 270 

Ala Ser Asn Gly Leu Met Glu Thr Val Ala Leu Gly Leu Gly Leu Asn 
275 280 285 

Met Glu Thr His He Ser Thr His Arg Ala Ser Pro He Leu Glu lie 
290 295 300 

Leu Glu ser Glu Arg Leu Glu Thr Arg Pro Ala Ser Pro Gly Leu Ser 

305 310 315 320 

Glu Arg Leu Glu Leu Tyr Ser Pro Arg Cys Tyr Ser Val Ala Leu Leu 
325 330 335 

Tyr Ser Leu Glu Thr His Arg Pro Arg Leu Glu Cys Tyr Ser Val Ala 
340 345 350 

Leu Thr His Arg Leu Glu Ala Ser Asn Cys Tyr Ser Thr His Arg Ala 
355 360 365 
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Ser Pro Leu Glu Leu Tyr Ser Ala Ser Asn Gly Leu Thr His Arg Leu 
370 375 380 

Tyr Ser Thr His Arg Ala Ser Asn Ser Glu Arg Ser Glu Arg Ala Ser 
385 390 395 400 

Pro Ala Leu Ala Ala Ser Asn Ser Glu Arg Ala Ser Asn Ser Glu Arg 
405 410 415 

Gly Leu Tyr Gly Leu He Leu Glu Met Glu Thr Gly Leu Tyr Ala Ser 
420 425 430 

Asn Gly Leu Gly Leu lie Leu Glu Leu Tyr Ser Ala Ser Asn Cys Tyr 
435 440 445 

Ser Ser Glu Arg Pro His Glu Ala Ser Asn Val Ala Leu Ser Glu Arg 
450 455 460 

Thr His Arg Gly Leu Tyr Ala Leu Ala Pro Arg Gly Leu Tyr Leu Tyr 

A~rn 475 480 

465 470 

Ser Val Ala Leu Gly Leu Asn Leu Tyr Ser Gly Leu Thr Tyr Arg Ser 
485 490 495 

Glu Arg Leu Glu Pro His Glu Thr Tyr Arg Ala Leu Ala Leu Glu Ala 
500 505 510 

Ser Pro He Leu Glu Val Ala Leu Ser Glu Arg He Leu Glu Leu Tyr 
515 520 525 

Ser Ala Ser Asn Gly Leu Ala Ser Asn Ala Ser Asn Ser Glu Arg Thr 
530 535 540 

His Arg Ser Glu Arg His lie Ser Met Glu Thr Leu Glu Thr His Arg 
545 550 555 560 

Ser Glu Arg Cys Tyr Ser Ala Ser Asn Thr His Arg Ser Glu Arg Val 
565 570 575 

Ala Leu Ser Glu Arg Thr His Arg Gly Leu Asn Ala Leu Ala Cys Tyr 
580 585 590 

Ser Pro Arg Leu Tyr Ser Val Ala Leu Ser Glu Arg Pro His Glu Gly 
595 600 605 

Leu Pro Arg lie Leu Glu Pro Arg lie Leu Glu His He Ser Thr Tyr 
610 615 620 



Arg Cys Tyr Ser Ala Leu Ala Pro Arg Ala Leu Ala Gly Leu Tyr Pro 
625 



630 635 640 



His Glu Ala Leu Ala He Leu Glu Leu Glu Leu Tyr Ser Cys Tyr Ser 
645 650 655 

Ala Ser Asn Ala Ser Pro Leu Tyr Ser Leu Tyr Ser Pro His Glu Ala 
660 665 670 
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Ser Asn Gly Leu Tyr Thr His Arg Gly Leu Tyr Pro Arg Cys Tyr Ser 
675 680 685 

Ala Ser Asn Ala Ser Asn Val Ala Leu Ser Glu Arg Thr His Arg Val 



690 



695 



700 



Ala Leu Gly Leu Asn Cys Tyr Ser Thr His Arg His lie Ser Gly Leu 



705 



710 715 "0 



Tyr lie Leu Glu Ala Arg Gly Pro Arg Val Ala Leu Val Ala Leu Ser 

Y 725 730 735 

Glu Arg Thr His Arg Gly Leu Asn Leu Glu Leu Glu Leu Glu Ala Ser 
740 745 750 

Gly Leu Tyr Ser Glu Arg Leu Glu Ala Leu Ala Gly Leu Gly Leu 



Asn 

755 



760 765 



Gly Leu Val Ala Leu val Ala Leu Leu Glu Ala Arg Gly Ser Glu Arg 
770 775 780 



Ala Leu Ala Ala Ser Asn Pro His Glu Ser Glu Arg Ala Ser Pro Ala 
785 790 795 



Ser Asn Ala Leu Ala Leu Tyr Ser Thr His Arg lie Leu Glu lie Leu 
805 810 815 

Glu val Ala Leu Gly Leu Asn Leu Glu Ala Ser Asn His lie Ser Ser 
820 825 830 

Glu Arg val Ala Leu Gly Leu lie Leu Glu Thr His Arg Cys Tyr Ser 
835 840 845 

Thr His Arg Ala Arg Gly Pro Arg Ala Ser Asn Thr Tyr Arg Ala Ser 
850 855 860 

Asn Gly Leu Thr His Arg Leu Tyr Ser Leu Tyr Ser He Leu Glu Ala 
865 870 875 

Arg Gly He Leu Glu His lie Ser Ala Arg Gly Gly Leu Tyr Thr Tyr 
9 885 890 895 

Arg Gly Leu Tyr Ala Arg Gly Ser Glu Arg Pro His Glu Val Ala Leu 
900 9° 5 910 



Thr His Arg 
915 



val Ala Leu Ala Arg Gly Leu Tyr Ser Leu Glu Gly Leu 
920 925 

Tyr Ala Ser Pro Ala Arg Gly Leu Tyr Ser Gly Leu Asn Ala Leu Ala 
930 «5 940 

His lie Ser Cys Tyr Ser Thr His Arg Met Glu Thr Ala Ser Asn Ala 
945 950 955 

Arg Gly Thr His Arg Leu Tyr Ser Thr Arg Pro Ala Ser Pro Ala Ser 
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Asn Ala Leu Ala Leu Glu Leu Tyr Ser Gly Leu Asn lie Leu Glu Ala 
980 985 99° 

Leu Ala Ser Glu Arg Leu Tyr Ser Leu Glu Ala Arg Gly Gly Leu Gly 
99S 1000 1° 05 

Leu Asn Pro His Glu Ala Ser Asn Leu Tyr Ser Thr His Arg Ala Leu 
1010 1015 1020 

Ala lie Leu Glu lie Leu Glu Pro His Glu Ala Ser Asn Ala Arg Gly 
1025 1030 1035 "40 

Ser Glu Arg Ser Glu Arg Gly Leu Tyr Gly Leu Tyr Ala Ser Pro Leu 
1045 1050 1055 

Glu Gly Leu lie Leu Glu Gly Leu Met Glu Thr His lie Ser Ser Glu 
1060 1065 1070 

Arg Pro His Glu Ala Ser Asn Cys Tyr Ser Gly Leu Tyr Gly Leu Tyr 
1075 1080 1085 

Gly Leu Leu Glu Pro His Glu Thr Tyr Arg Cys Tyr Ser Ala Ser Asn 
1090 1095 HOC 

Thr His Arg Thr His Arg Leu Tyr Ser Leu Glu Pro His Glu Ala Ser 
1105 mo 1115 1120 

Asn Ser Glu Arg Thr His Arg Thr Arg Pro Ala Ser Asn Gly Leu Thr 
1125 H30 I" 5 

His Arg Thr His Arg Gly Leu Ser Glu Arg Ala Ser Asn Gly Leu Tyr 
1140 H45 H50 

Leu Tyr Ser Gly Leu Tyr Gly Leu Ala Ser Asn He Leu Glu Thr His 
1155 H60 H 65 

*rg Leu Glu Pro Arg Cys Tyr Ser Ala Arg Gly lie Leu Glu Ala Arg 
1170 11" I 180 

Gly Gly Leu Asn Pro His Glu Val Ala Leu Ala Ser Asn Met Glu Thr^ 
1185 H90 II 95 

Thr Arg Pro Gly Leu Asn Leu Tyr Ser Val Ala Leu Gly Leu Tyr Leu 
1205 1210 "l 5 

Tyr ser Ala Leu Ala Met Glu Thr Thr Tyr Arg Ala Leu Ala Pro Arg 
1220 1225 1230 

Pro Arg Ser Glu Arg Ala Ser Pro Gly Leu Tyr Gly Leu Asn lie Leu 
1235 1240 1245 

Glu Ala Arg Gly Cys Tyr Ser Thr His Arg Ser Glu Arg Ala Ser Asn 
1250 1255 1260 

He Leu Glu Thr His Arg^Gly Leu Tyr Leu Glu^Leu Glu Leu Glu Thr^ 
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His Arg Ala Arg Gly Ala Ser Pro Gly Leu Tyr Gly Leu Tyr Gly Leu 
1285 1290 1295 

Tyr Pro Arg Ser Glu Arg Ala Ser Pro Ala Ser Asn Leu Tyr Ser Ala 
1300 1305 1310 

Ser Pro Leu Tyr Ser Gly Leu Thr His Arg Pro His Glu Ala Arg Gly 
1315 1320 1325 

Pro Arg Gly Leu Tyr Gly Leu Tyr Gly Leu Tyr Ala Ser Pro Met Glu 
1330 1335 1340 

Thr Ala Arg Gly Ala Ser Pro Ala Ser Asn Thr Arg Pro Ala Arg Gly 
1345 1350 1355 1360 

Ser Glu Arg Gly Leu Leu Glu Thr Tyr Arg Leu Tyr Ser Thr Tyr Arg 
1365 1370 1375 

Leu Tyr Ser Val Ala Leu lie Leu Glu Leu Tyr Ser He Leu Glu Gly 
1380 1385 1390 

Leu Pro Arg Leu Glu Gly Leu Tyr Val Ala Leu Ala Leu Ala Pro Arg 
1395 1400 I 405 

Thr His Arg Leu Tyr Ser Ala Leu Ala Leu Tyr Ser Ala Arg Gly Ala 
14 10 1415 1420 

Ara Gly val Ala Leu Val Ala Leu Gly Leu Asn Ala Arg Gly Gly Leu 
14 25 1430 1435 1440 

Leu Tyr Ser Ala Arg Gly Ala Leu Ala Val Ala Leu Gly Leu Tyr Met 
1445 1450 I 455 

Glu Thr Val Ala Leu Gly Leu Tyr Ala Leu Ala Met Glu Thr Pro His 
1460 1465 1470 

fllu Leu Glu Glv Leu Tyr Pro His Glu Leu Glu Gly Leu Tyr Ala Leu 
1475 1480 1485 

Ala Ala Leu Ala Gly Leu Tyr Ser Glu Arg Thr His Arg Met Glu Thr 
1490 1495 1500 

Gly Leu Tyr Ala Leu Ala Ala Leu Ala Ser Glu Arg Leu Glu Thr His 
1505 1510 1515 1520 

Arg Leu Glu Thr His Arg Val Ala Leu Gly Leu Asn Ala Leu Ala Ala 
1525 1530 1535 

Arg Gly Gly Leu Asn Leu Glu Leu Glu Ser Glu Arg Gly Leu Tyr lie 
1540 1545 1550 

Leu Glu Val Ala Leu Gly Leu Asn Gly Leu Asn Gly Leu Asn Ala Ser 
I555 1560 1565 

Asn Ala Ser Asn Leu Glu Leu Glu Ala Arg Gly Ala Leu Ala He Leu 
1570 1575 1580 
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Glu Gly Leu Ala Leu Ala Gly Leu Asn Gly Leu Asn His lie Ser Leu 
1585 1590 1S95 160° 

Glu Leu Glu Gly Leu Asn Leu Glu Thr His Arg Val Ala Leu Thr Arg 
160S 1610 1615 

Pro Gly Leu Tyr He Leu Glu Leu Tyr Ser Gly Leu Asn Leu Glu Gly 
1620 1625 1630 

Leu Asn Ala Leu Ala Ala Arg Gly Val Ala Leu Leu Glu Ala Leu Ala 
1635 1640 1645 

Val Ala Leu Gly Leu Ala Arg Gly Thr Tyr Arg Leu Glu Leu Tyr Ser 
1650 1655 1660 

Ala Ser Pro Gly Leu Asn Gly Leu Asn Leu Glu Leu Glu Gly Leu Tyr 
1665 1670 1675 1680 

He Leu Glu Thr Arg Pro Gly Leu Tyr Cys Tyr Ser Ser Glu Arg Gly 
1685 1690 1695 

Leu Tyr Leu Tyr Ser Leu Glu lie Leu Glu Cys Tyr Ser Thr His Arg 
1700 1705 1710 

Thr His Arg Thr His Arg Val Ala Leu Pro Arg Thr Arg Pro Ala Ser 
1715 1720 1725 

Asn Ala Leu Ala Ser Glu Arg Thr Arg Pro Ser Glu Arg Ala Ser Asn 
1730 1735 1740 

Leu Tyr Ser Ser Glu Arg Leu Glu Ala Ser Pro Gly Leu Asn He Leu 
1745 1750 1755 1760 

Glu Thr Arg Pro Ala Ser Asn Ala Ser Asn Met Glu Thr Thr His Arg 
1765 1770 1775 

Thr- Arg Pro Leu Glu Gly Leu Thr Arg Pro Ala Ser Pro Ala Arg Gly 
1780 1785 1790 

Gly Leu He Leu Glu Ala Leu Ala Ala Ser Asn Thr Tyr Arg Thr His 
1795 1800 180S 

Arg Ala Ser Asn Leu Glu lie Leu Glu His lie Ser His lie Ser Leu 
1810 1815 1820 

Glu He Leu Glu Gly Leu Gly Leu Ser Glu Arg Gly Leu Asn Ala Ser 
1825 1830 1835 1840 

Asn Gly Leu Asn Gly Leu Asn Gly Leu Leu Tyr Ser Ala Ser Asn Gly 
1845 1850 1855 

Leu Gly Leu Asn Gly Leu Leu Glu Leu Glu Gly Leu Leu Glu Ala Ser 
1860 1865 1870 

Pro Leu Tyr Ser Thr Arg Pro Ala Leu Ala Ser Glu Arg Leu Glu Thr 
1875 1880 1885 
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Arg Pro Ser Glu Arg Thr Arg Pro Pro His Glu Ala Ser Pro lie Leu 
1890 1895 190° 

Glu Ser Glu Arg Ala Ser Asn Thr Arg Pro Leu Glu Thr Arg Pro Thr 
1905 19" 1915 "SO 

Tvr Arg lie Leu Glu Leu Tyr Ser lie Leu Glu Pro His Glu He Leu 
1925 1930 1935 

Glu Met Glu Thr lie Leu Glu Val Ala Leu Ala Leu Ala Gly Leu Tyr 
1940 1945 I 950 

Leu Glu Val Ala Leu Gly Leu Tyr Leu Glu Ala Arg Gly He Leu Glu 
1955 I960 1955 

val Ala Leu Pro His Glu Ala Leu Ala Val Ala Leu Leu Glu Ser Glu 
1970 1975 1980 

Arg lie Leu Glu Val Ala Leu Ala Ser Asn Ala Arg Gly Val Ala Leu 
1985 1990 1995 2000 

Ala Arg Gly Gly Leu Asn Gly Leu Tyr Thr Tyr Arg Ser Glu Arg Pro 
2005 2010 2015 

Arg Leu Glu Ser Glu Arg Pro His Glu Gly Leu Asn Thr His Arg His 
2020 2025 2030 

lie Ser Pro His Glu Pro Arg Ala Leu Ala Pro Arg Ala Arg Gly Gly 

2035 2040 2045 

Leu Tyr Pro Arg Ala Ser Pro Ala Arg Gly Pro Arg Ala Ser Pro Gly 
2050 2055 2060 

Leu Tyr He Leu Glu Gly Leu Gly Leu Tyr Gly Leu Gly Leu Tyr Gly 

2065 2070 2075 

Leu Tyr Gly Leu Ala Arg Gly Ala Ser Pro Ala Arg Gly Ala Ser Pro 
2085 2090 

Ala Arg Gly Ser Glu Arg Val Ala Leu Ala Arg Gly Leu Glu Val Ala 
2100 2105 2110 

Leu Ala Ser Pro Gly Leu Tyr Pro His Glu Leu Glu Ala Leu Ala Leu 
211S 2120 2125 

Glu Leu Glu Thr Arg Pro Gly Leu Ala Ser Pro Leu Glu Ala Arg Gly 
2130 2135 2140 

Ala Ser Asn Leu Glu Cys Tyr Ser Leu Glu Pro His Glu Ser Glu Arg 
2145 2150 2155 2160 

Thr Tyr Arg His lie Ser Ala Arg Gly Leu Glu Ala Arg Gly Ala Ser 
2165 2170 2175 

Pro Leu Glu Leu Glu Leu Glu lie Leu Glu Val Ala Leu Thr His Arg 
2180 2185 2190 



105 



SUBSTITUTE SHEET (RULE 26) 



PCT7US98/12990 

WO 98/59074 

Ala Arg Gly lie Leu Glu Val Ala Leu Gly Leu Leu Glu Leu Glu Gly 
2195 2200 2205 

Leu Tyr Ala Arg Gly Ala Arg Gly Gly Leu Tyr Thr Arg Pro Gly Leu 
2210 22 1S 2220 

Ala Leu Ala Leu Glu Leu Tyr Ser Thr Tyr Arg Thr Arg Pro Thr Arg 
2225 "30 "35 2240 

Pro Ser Glu Arg Leu Glu Leu Glu Gly Leu Asn Thr Tyr Arg Thr Arg 
2245 2250 2255 

Pro Ser Glu Arg Gly Leu Asn Gly Leu Leu Glu Leu Tyr Ser Ala Ser 
2260 2 265 2270 

Asn Ser Glu Arg Ala Leu Ala Val Ala Leu Ala Ser Asn Leu Glu Pro 
2275 2280 2285 

His Glu Ala Ser Asn Thr His Arg Ala Arg Gly Ala Leu Ala lie Leu 
2290 2295 "00 

Glu Val Ala Leu Val Ala Leu Ala Leu Ala Gly Leu Gly Leu Tyr Thr 
2305 2310 2315 "20 

His Arg Ala Ser Pro Ala Arg Gly He Leu Glu He Leu Glu Gly Leu 
232S 2330 2335 

Val Ala Leu Val Ala Leu Gly Leu Asn Ala Arg Gly Leu Glu Cys Tyr 

2340 "45 2350 

Ser Ala Arg Gly Ala Leu Ala lie Leu Glu Leu Glu His He Ser lie 
2355 2360 2365 

Leu Glu Pro Arg Ala Arg Gly Ala Arg Gly lie Leu Glu Ala Arg Gly 
2370 2375 2380 

Gly Leu Asn Gly Leu Tyr Leu Glu Gly Leu Ala Arg Gly Pro His Glu^ 
2385 2390 2395 

Leu Glu Leu Glu 
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1 An isolated human immunodeficiency virus type 1 (HIV- 1 ), which is HIV- 1 JO 

identified by a nucleotide sequence as given in SEQ ID NO:ll, or which is HIV-1 NC , 
identified by a nucleotide sequence as given in SEQ ID NO: 12. 

2. A biological sample comprising the HIV- 1 JC of claim 1 . 

3 A biological sample comprising the HIV- 1 NC of claim 1 . 

4. The biological sample of claim 1 wherein said sample is a sample of blood. 

5 A biologically pure culture of host cells comprising the HIV- 1 JC or HI V- 1 NC of claim 
1. 

6. The culture of claim 5 wherein the host cells are peripheral blood mononuclear cells. 

7. A composition comprising an antigenic preparation of the HIV- 1 JC of claim 1 . 

8. A composition comprising an antigenic preparation of the HIV- 1 NC of claim 1 . 

9. A kit for detecting the presence of HIV- 1 antibodies comprising an antigenic 
preparation of the HTV-1, C or HIV-1 NC of claim 1. 

10. An immunogenic composition comprising an antigenic preparation of the HIV- 1 K or 
HTV-1 NC of claim 1 and a pharmaceutically acceptable carrier. 



11. A vaccine 



comprising an antigenic preparation of the HIV-1 JC or HTV-1 NC of claim 1. 



12. An isolated DNA molecule comprising a nucleotide sequence encoding an infectious 
molecular clone for HIV-1 JC or an antigenic fragment thereof or for HIV-1 NC or an 
antigenic fragment thereof. 
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13. 



The isolated DNA molecule of claim 12, wherein the DNA comprises a nucleotide 
sequence encoding an HIV-1 JC envelope protein as given in SEQ ID NO:2, an HIV-1 JC 
gag, nef or p24 protein, or an antigenic fragment of any of the foregoing.. 

1 4. The isolated DNA molecule of claim 1 3, wherein the DNA comprises the nucleotide 
sequence of SEQ ID NO: 1 or a portion thereof specifying an antigenic fragment of 
said HIV-1 JC envelope protein. 



15. 



The isolated DNA molecule of claim 12, wherein the DNA comprises a nucleotide 
sequence encoding an HIV-1 NC envelope protein as given in SEQ ID NO:23, an HIV- 

'nc 



1 NC gag, nef or p24 protein, or an antigenic fragment of any of the foregoing. 



16. A method of inducing antibodies to HIV- 1 JC in a nonhuman mammalian subject 
comprising the steps of (a) administering to the subject an immunogenic amount of an 
antigenic preparation of the HIV-1 JC and optionally (b) harvesting said antibodies to 
HIV-1 K . 

1 7. The method of claim 1 6 wherein said subject is a primate. 

18. A method of inducing antibodies to HIV-1 NC in a nonhuman mammalian subject 
comprising the steps of (a) administering to the subject an immunogenic amount of an 
antigenic preparation of the HIV-1 NC and optionally (b) harvesting said antibodies to 
HIV-1 NC . 

19. The method of claim 1 8 wherein said subject is a primate. 

20. A method of immunizing a subject against the development of acquired immune 
deficiency syndrome (AIDS) comprising the step of administering to said subject an 
immunogenic amount of an antigenic preparation of HIV-1 JC or HIV-1 NC such that 
antibodies directed to HIV-1 JC or HIV-1 NC are produced in said subject and thereby 
symptoms of AIDS are diminished or prevented. 

21. The method of claim 20 wherein said subject is a primate. 
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22, A method for inducing acquired immune deficiency syndrome (AIDS) in a nonhuman 
primate, comprising the step of administering to said primate an effective amount of 
an infections preparation of the H1V-1 of claim 1 such that said primate develops 
AIDS. 

23. The method of claim 22 wherein said infections preparation of HIV- 1 is a biological 
specimen obtained from a nonhuman primate infected with HIV-1 JC and exhibiting 
AIDS. 



24. 



The method of claim 22 wherein said infections preparation of HIV-1 is a biological 
specimen obtained from a nonhuman primate infected with HIV-1 NC and exhibiting 
AIDS. 

25. A method of screening for HIV- 1 in a biological sample comprising the step of 
introducing into said biological sample a hybridization probe comprising a nucleotide 
sequence comprising at least about 15 contiguous nucleotides from SEQ ID NO:l 1 or 
a sequence complementary thereto or from SEQ ID NO: 12 or a nucleotide sequence 
complementary thereto under stringent conditions such that said hybridization probe 
binds to an HIV-1 JC gene or a nucleotide sequence having at least about 95% 
nucleotide sequence identity thereto or an HIV-1 NC gene or a nucleotide sequence 
having at least about 95% nucleotide sequence identity thereto. 

26. A nonhuman primate model infected with at least one HIV- 1 of claim 1 and exhibiting 
symptoms of AIDS, useful for the development of a drug or vaccine for the treatment 
or prevention of AIDS. 

27. The primate model of claim 26 wherein said primate is a chimpanzee or a macaque. 

28. The primate model of claim 27 wherein said HIV- 1 is HTV-1 1C . 

29. The primate model of claim 27 wherein said HIV-1 is HTV-1 NC . 
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